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DIAGNOSTIC METHOD 

The inventors have discovered a human gene linked to susceptibility to inflammatory 
bowel disease (JED) using linkage and association analysis, The present invention therefore 
5 rebates to diagnostic techniques for the detection of IBD, and for determining a patient's 
susceptibility to develop IBD by detecting all or part of this gene, its precursors or products 
(eg. iriRNA, cDNA, genomic DNA, or protein). The invention is also directed to methods 
for identifying modulators of IBD, which modulators, such as chemical compounds, antisense 
molecules and antibodies modulate the gens identified. 
10 Background to Invention 

Inflammatory bowel disease QEB) is characterised by a chronic relapsing intestinal 
inflammation of the gastrointestinal tract. It affects -1/1,000 individuals in Western countries 
with the median age of onset in early adulthood To date, the etiology of this disease is 
unknown, Based on clinical and histopathologic^ features. IBD is categorised into two main 

15 subtypes, Crohn's disease (CD) (On Line Mendelian Inheritance in Man - a database 
produced by Johns HopkinB University available at NCBI, OMIM 266600) and ulcerative 
colitis (HQ (OMIM 191390). Although the cause of IBD is unknown, both familial clustering 
of the disease and increased concordance in monozygotic twins shows a strong genetic 
susceptibility. Estimates of sibling risk (Xs) show a range of 10-50, suggesting that genetic 

20 factors play a significant role in the predisposition to IBD. In the present context the term IBD 
is intended to include IBD, as well aB Crohn's disease and ulcerative colitis. 

Previous genome wide linkage analyses have identified a number of susceptibility locus 
for IBD, e.g. 1BD1 (OMIM 266600) (Hugot * aU Nature, 379:821-823, 1996; Brant et al, 
Gastroenteriogy 115:1056-1061, 1998; Currane/ aL t Gastroenterology 115:1066-1071, 1998; 

25 and, Hampe et aL t Am J. Hum. Genet 64:808-816, 1999a), IBD 2 (OMIM 601458) (Duett et 
ah, Am, J. Hum Genet 63:95-100, 1988; and, Paikes etal., Am. J. Hum. Genet 67:1605- 
1610, 2000), IBD3 (OMIM 604519) (Hampe et al. Am. J. Hum, Genet 65:1647-1655, 
1999b), IBD7 (OMIM 605225) (Cho etal., Proc. Nat Acad. ScL 95:7502-7507; and, Cho et 
(A. t Hum. Molec. Genet 9: 1425-1432, 2000). 

30 There Is therefore a desire to identify genes with a significant association to the 
development of IBD. This may enable the development of novel therapies for IBD by 
screening for compounds and other entities, such as antibodies, which modulate the activity of 
the proteins encoded by the associated genes. Knowledge of the sequence of the associated 
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^smay also ^fc*^^ atUigenetnethofc to adulate tha express 
tot^tlBD. The discovery of associated genes may also as sis t in devBlopj^g novel metbodB 

S polymorphism (SNPs), (ii) measuring the levels of the Scribed mRNA present m affected 
to8Re ° r m meMcrin g thfi ^els of the protein in affected tissue, it is possible that the 
diagD08iS ° fIBD ' ° r the F" 5 **** of opposition to IBD, by these method, may bo 
achieved in patients who do not yet display the classical symptoms of the disease. Such 

^nnination of suscepti bility to IBD or the early detection of d ise^ development may lead 

10 to earlier clinical intervention than is currently possible and may lead to more eSc^e" 

treatment of the disease. 

116 ***** kvBntion 18 ***** on ™ discovery of an association with IBD for a single 
gene termed d%5 located on chromosome 10q22.3. 

As used herein, the gene is referred to as the dlgS gene. Specifically, the cDNA 
. IS sequence is shown in SEQ ID No: 1. Encoded protein is shown in SBQ ID No: 2 and is 
referred to as DLGS, 

DL<55 belongs to the so-called MAGUK fbmily of proteins (Membrane Associated 
BM *" Iamao '' revtovred ta DfateMos « al BioBasav. 2 1*12*21, 1999). Proteins of 

20 »™el PDZ dotaafca (p,,,,,^ ^ , s ^ ^ ^ ^ ^ 

SHSdonamfsrchomoloffl.domama). mj^^aB^^^ 
■neda*, prceenvprotoin interaction* In several caseB where PDZ domain interactiona have 

. ^^'^ttoyha™l^al B ™toi IIt c^wia te «C* n ^ 

» ^^"^^Itorichau^^o,^,,,,,,^.^^ e 
bnase domain to mm MAOUK proteins tan been sta™ to p^.^ 

a^oo. wMle U !aoaa ^ ae*i W U i, Mmd ^ ^ ^ ^ 

d* oon^is to„^a teprott n>p^ 

concede, seaflbld prototas, c.ohe,,^ mto ^ to ^ memtnmc 

30 location.. Beades.s.abli.lnnantof oaBpo^of MAGUK proteins bnve afco been 

bnpbcetod in eatablirtmen, of posaynaptiu compart^ m ^ 

■ 2:5 "- S86 antateracaonbeuseen.PDZ domain ofthe 

.... ^^^^l^Sand^lnn-ace^uno^^,^^^ 
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identified. Since it has been identified that the CMennmi of some 50 mtracelhdaf and 
membrane proteins have Ugh affinity for the VDZ domains of hDLG-l/PSD-95, it has been 
hypothesised that such clustering of scaffold proteins to multiple membrane receptors is 
responsible for localisation of neuroreceptors at postsynaptic sites (Kreienkamp, Curr. Opin. 

5 Phatm, 2:581-586, 2002). 

Until recently, little has been known about the function of DLG5. Partial EST sequences 
for dlg5 were identified from a database search by its similarity to other MAGTJK proteins 
(Nakaniura et al. r EBBS 433:63-457, 1998). The authors identified a partial cDNA sequence 
referred to as P-dlg and showed by inmwnostaining flat the protein was expressed in 

10 epithelial gland cells of the prostate. It was also shown by a two-hybrid screen that the 
DLG5/P-dlg specifically interacted withp55, another MAGUK protein. Northern blot 
analyses showed variable expression in multiple tissues (Nakamnra et al, FBBS 433:63-67, 
1998 and Shah et al., BMC Genomics 3:6 2002). Shah et al also showed that the human 
gene consisting of 32 exons, encoded a full length DLG5 protein of 1809 amino acids. Hie 
15 DLG5 protein contains 4 PDZ domains followed by an SH3 domain and a G terminal 
guanylate kinase domain. 

It was recently shown that DLG5 could be identified in a two-hybf id screen using 
vinexin as bait (Wakabayashi et al y JBC, 25th March 20032003). Furthermore, the authors 
- showed that vinexin, DLG5 and 0-catemn could form a ternary complex, providing a direct 
20 link to the adhesion junction complex in epithelial cells. 

In vertebrate gut epithelial cells three types of cell junctions are formed (reviewed in ' 
Tsukita et al,. Nature Rev. MoL Cell Biot 2:285-293, 2001). light junctions are located . 
towards the apical border of the basolateral side and are considered to function bom as a 
barrier for the extracellular environment as well as a fence for membrane diffusion. 
25 Adherence junctions are formed immediately basolateral of tight junctions and their role is 
less clear than for tight junctions. They are considered to be important for the mechanical 
strength of cell contacts, but it is also clear that their regulation has to be precisely 
coordinated with tight junctions, ibr example when immune cells passes through the epithelial 
barrier. The mutual dependence between tight junctions and adherence junctions are 
30 underscored by the fmdings that while formation of tight junctions does not occur until 
adherence junctions are intact, adherence junctions can not form when formation of tight 
junctions are inhibited by overexpression of a dominant negative mutant of the tight junction 
MAGUKproteinZO-3CWittchen« a Z. l J.CelLBioi 151:825-836, 2000; and refs merein). 
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Finafly, multiple desmosornes are located along the basolateral sides and are mainly 
considered to contribute to the mechanical strength of cell contacts. 

Many proteins have been shown to be localised to cellular junctions. Membrane 
proteins, such as for example occhidm and claudina, are found at tight junctions while 
5 i^mb^ofthecaoTjerm family mediates ceu^eH contacts at adhere 

»umb» Of proteins connect to the cytoplasmatic side of these membrane proteins, linking the 
complexes both to the actin cytoskeleton and to intracellular signaling. At adherence 
junctions, the cytoplasmatic part of cadherin binds p-catenin, thus proving a link between 
DI/3S and adherence junctions. 



10 The background data above strongly supports a functional role for DLG5 in gut ' 
epithelial cell function and integrity. The inventors propose that proteins) encoded by the 
digs gene, whichhas only now been identified as being genetically linked to susceptfoOity to 
IBD, are directly or indirectly involved in the pathogenesis of gut inflammation. 

The inventors have identified 20 unique nucleotide variations within the dlg5 gene, four 

15 of these result in codon changes, a further two are deletions. 
Summary of die inventing 

The inventors have identified a gene located on chromosome 10q22.3, termed dlg5, 
which demonstrates genetic association linkage to susceptibility to IBD. The gene, mRNA 
(or cDNA prepared therefrom) and protein sequences corresponding to such transcript are 

20 therefore diagnostic or prognostic markers of IBD, and can be used to design specific probes, 
or to generate antibodies, capable of detecting the presence of nucleotide sequence 
polymorphic or mutations of the gene or mRNA, or of measuring the levels of the mRNA or 
encoded protein present in a test sample, such as a body fluid or cell sample. In addition the 
gene and protein encoded thereby is a potential target for therapeutic intervention in IBD 

25 disease, for instance in the development of antisense nucleic acid targeted*, the mRNA, or 
transgenic therapies; or more widely in the identification or development of chemical or 
hormonal therapeutic agents. The person skilled in the art is also capable of devising 
screening assays to identify compounds (chemical or biological) that modulate (activate or 
inhibit) the identified gene or encoded protein, which compounds may prove useful as 
30 therapeutic agents in treating or preventing IBD. .. 
Detailed de^frtir^ of the invanfi^h 

According to a first aspect of the invention there is provided a method for identifying a 
compound capable of modulatiug the action of the DLG5 protein which method comprises 
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^bjecting one or more test compounds to a screen comprising a polypeptide containing the 
amino acid sequence shown in SBQ ID NO: 2, or a hamologde (hereof or a fragment of either. 

Tie term "fragment 1 ' as used herein refers to a subsequent of the foil length sequence 
that comprises at least 25, preferably at least 50, wore preferably at least 100 consecutive 
5 amino acids of the sequence depicted in&BQID NO: 2, preferably the fragment is a ' 
polypeptide that is the DLG5 protein with either or bothC-termmal and N-terminal 
truncations. 

It is understood that the polypeptide for use in the invention may be both a fragment and 
a homologue of the DLGS protein. 
10 hi a preferred embodiment, the screening methods of the invention are carried out using 
a polypeptide comprising an amino acid sequence as depicted in SBQ ID NO 2, or a sequence 
possessing, hi increasing order of preference> at least 80%, 85%, 90%, 95%, 97%, 98% and 
99% amino acid sequence itentity thereto. Such variants are herein referred to as 
"homologues". 

IS The sequence identity between two sequences can be deteonined by pair-wise computer. 
aBgmnent analysis, using programs such as, BestFit, Gap or RaimAlign, The preferred .. 
alignment tool is BesfPit. In practise, when searching for similar/identical sequences to the 
query search, from within a sequence database, it is generally necessary to perform an initial 
identification of similar sequences using suitable software such as Blast, Blast2, NCBI Blast2, 

20 WashU Blast2, FastA, Fasta3 and PILEUP, and a scoring matrix such as Bbsum 62. Such 
software packages endeavour to closely approximate the "gold- standard" alignment algorithm 
of Smith^Waterman. thus, the selected softWsearch engine programme for use in . 
assessing id^tity/simflarity, Le how two primary polypeptide sequences line up is Smith- 
Waterman. Identity refers to direct matches, similarity allows for conservative substitutions. 

25 AUeUc V9riaDtfl or of the DLGS protein may exist within the human population, 

particularly between distinct ethnic groupa. A former aspect of the invention involves the 
selection and use of me appropriate version of the DLG5 protein to be included in screens so 
as to discover compounds capable of altering the activity of said DLGS version in vivo. The 
inventors have identified four codon changing nucleotide polyo^rprnsms within one or other 

30 exons of dTg5 gene , each of these, alone or in combination, would provide numerous allelic 
variant protein versions of DLGS for use in any aspect of the present invention, 

Investigators may wish to screen their compounds against the most prevalent version of 
the DLGS protein and also against the less frequent versions of the DLG5 protein in order to 
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detect m differential pharmacological activity between the various versions of the target A 
further aspect of the invention is the screening of various ethnic based populations to measure 
the allele frequencies of die nucleotide polymorphisms in the dlg5 gene within said 
populating. This information may be of value in estimating the efficacy of new compounds 
5 capable of altering the activity of DLG5 within these populations and in particular in 
estimating the proportion of the population which may not respond to the therapy. 

Polymorphism refers to the occurrence of two or more genetically determined 
alternative alleles or sequences within a population. A polymorphic marker is the site at 

divergence occurs, Preferably markers have, at least two alkies, each occurring at 

10 frequency of greater than 1%, and more preferably at least 10%, 15%, 20%, 30% or more of a 

selected population. 

Single nucleotide polymorphisms (SNP) are generally, as the name implies, single 
nucleotide or point variations that exist in the nucleic acid sequence of some members of a 
species. Such polymorphism variation within the species, is generally regarded to be the 
15 xesfultof spontaneous mutation throughout evolutbn. The mutated and normal sequencesco- 
exist within the species' population sometimes in a stable or quasi-stable equilibrium At 
other times the mutation may confer some advantage to the species and with time may be 
incorporated into the genomes of all or a majority of members of the species. 

Some SNPs occur in the protein coding sequences, in which case, one of the 
20 polymorphic protein forms may possess a different amino acid which may give rise to the 
expression of a variant protein and, potentially, a genetic disease. These changes in function 
may be mediated by several mechanisms including, but not limited to, alterations in protein 
folding,- alterations in ligand and substrate binding affinity and alterations in membrane 
binding affinity and may lead to gain of activity or loss of activity for the protein in vivo. 
25 Such alterations in the activity of the protein in vfv» may be of clinical significance in the 
development of BD. Alteration to the amino acid sequence of the protein may also affect the 
efficacy of drug therapy for H3D by altering the specificity between protein and compounds 
selected by screening to modulate its activity. Thu 8 compounds selected by screening may 
have different efficacies in modulating the activity of protein in different individuals 
• 30 -according to the versions of the gene that they carry. In particular an individual who is 
homozygous for aless common variant of the gene may not respond wen to a therapy 
developed by screening compounds against the dominant variant. 
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sppropnate substrate into contact wllhtha polypeptide, or a cell capable of producing it, or a 

5 wifh standard techniques. 

Any compound identified ta this way may prove ^ in the treatment oflBD in 
l™and/oTother animals. The Wion thus extends to * compound selected through its 
ability to regain the activity of the DLG5 protein to vrva as primarily deternnned in a 
screening assay utilising the polypeptide containing an amino acid sequence shown in SEQ 
10 ID NO: 2, or a homologue or fragment thereof, or a gene coding therefore (such as that 
disclosed in SEQ ID NO: 1) for use in the treatment of a disease in which the over- or under- 
activity or unregulated activity of the protein is implicated. 

According te a further aspect of the invention there is provided a screening assay or 
-thod for identifying potential anti-IBD therapeutic compounds conprising contacting an . 
•15 assay systemoapabletfdetec^^ 

DLG5, with a test compound and assessing the change in expression level of DLG5. 

Compounds that modulate the expression ofDNA or RNA of DLG5 polypeptides may 
be detected by a variety of assay systems. A snltahle assay systemmay he a simple 'Wno» 
M ^»*^wl»tber^ 
20 galactesidas,^ 

*«(reviewedbyNaybr,Bbc^ 1999). The assay system may be 

made quantitative by comparing the expression or function of a test sample with the levels of 
ex^ss^orfuncrionmastar^sa^le. Systems in which transcription factors areused 

tran^tfonfactoxtosthnnlateaneg^ 

as a "reverse one-hyhrid system" (Vidal et al, 1996, supra). Therefore, m an embodiment of 
the present invention, a reporter gene is placed under the control of &e dlg5 promoter. A 
suitable dlg5 promoter sequence is disclosed in SEQ ID NO: 5. 
30 In a further aspect of the invention we provide a cell or cell line comprising a reporter - 
gene under the control of the dlg5 promoter. 

According to another aspect of the present invention there is provided a method of 
ficreenmg for a compound potentially useful for treatment of IBD, which comprises assaying 
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the conqxrund for its ability to modulate the activity or amount of DLGS. Preferably the 
assay is selected from 

(9 measurement of DLGS activity using a cell line which, expresses thp DLG5 polypeptide or 
using purified DLGS polypeptide; and 
5 . (il) measurement of dlgS transcription or translation in a cell line expressing the DLG5 
polypeptide. 

The "DLGS polypeptide" refers to the DLG5 protein, a homologue thereof, or a 
fragment of either. 

Thus, in a further aspect of the invention, cell cultutes expressing the DLG5 polypeptide 



10 can "be used in a screen for therapeutic agents. Effects of test cpn^cru^s'm^y'bsra&sayed- 1 
changes ininRNA or protein of DLG5. As described below, cells (Le. mammalian, "bacterial, 
yeast etc.) can be engineered to express the DLG5 polypeptide. 

Thus, according to a further aspect of the invention there is provided a method of testing 
potential therapeutic agents for the ability to suppress TED phenotype comprising contacting la 
IS test compound with a cell engineered to express the DLGS polypeptide; and determining 
whether said test compound modulates expression of the DLGS polypeptide. 

We also provide a method for identifying inhibitors of transcription of dlg5, which 
. method comprises contacting a potential therapeutic agent with a cell or cell line as described 
above and determining inhibition of dlg5 transcription by the potential therapeutic agent by 
20 reference to a lack of or reduction in expression of the reporter gene. 

Any convenient test compound or library of test compounds may be used in conjunction 
with the test assay. Particular test compounds include low molecular weight chemical 
compounds (preferably with a molecular weight less than 1500 daltons) suitable as 
ptamaceutical or veterinary agents for human or animal use, or compounds for non- 
25 administered use such as cleaning/sterilising agents or for agricultural use. Test compounds 
may also be biological in nature, such as antibodies. 

According to a further aspect of the invention there is provided a compound identified 
by a screening method as defined herein. 

According to another aspect of the present invention tjtere is provided use of a 
30 compound able to modulate the activity or amount of DLG5 in the preparation of a 

medicament for the treatment of IBD. Modulation of the amount of DLGS by a compound 
may be brought about for example through altered gene expression level or message stability. 
Modulation of the activity of DLG5 by a compound may also be brought about for example 
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through compound binding to the DLG5 protein. In one embodiment, modulation of DLG5 
comprises use of a cornpound able to reduce the activity or amount of DLG5. In another 
embodiment, modulation of DLG5 comprises use of a compound able to increase the activity 
or amount of DLG5. 

5 It will be appreciated mat the team 'for the treatment of IBD\ and variations thereon, 
includes therapeutic and prophylactic (preventative) treatment 

According to another aspect of the present invention there is provided a method of . 
preparing a pharmaceutical coinposition which comprises: 

i) identifying a compound as useful for treatment oflBD according to a screening method as 
10 described herein; and 

ii) mixing the compound or a pharmaceutical^ acceptable salt thereof with a 
pharmaceutioally acceptable excipient or diluent 

Ac<fcrding to a further aspect of the invention there is provided a method of treatment of 
a patient suffering from EBD comprising administration to said patient of an effective amount 
15 of a compound identified according to a screening method of the invention or a composition 
prepared by (he method described herein. 

The compositions of the invention may be in a form suitable for oral use (for example as 
tablets, lozenges, hard or soft capsules, aqueous or oily suspensions, emulsions, dispersible 
powders or granules, syrups or elixirs), for topical use (for example as creams, ointments, 
20 gels, or aqueous or oily solutions or suspensions), for adnnnistration by inhalation (for 

example as a finely divided powder or a liquid aerosol), for administration by insufflation (for 
example as a finely divided powder) or for parenteral administration (for example as a sterile 
aqueous or oily solution for intravenous, subcutaneous, intramuscular or mtratnuscular dosing 
or as a suppository for rectal dosing). 
25 The compositions of die invention may be obtained by conventional. procedures using 
conventional pharmaceutical excipients. well known in the art Thus, compositions intended 
for oral use may contain, for example, one or more colouring, sweetening, flavouring and/or 
preservative agents. 

Suitable pbannaceutically acceptable excipients for a tablet formulation include, for 
30 example, inert diluents such as lactose, sodium carbonate, calcium phosphate or calcium 
carbonate, granulating and disintegrating agents such as com starch or algenic acid; binding 
agents such as starch; lubricating agents such as magnesium Btearate, stearic acid or talc; 
preservative agents such as ethyl or propyl p-hydroxybenzoate, and anti-oxidants, such as : 
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ascQibicacid. Tablet fommktionfl may be uncoated or coated either to modify their 
disintegration and the subsequent absorption of the active ingredient withiu the 
gastrointestinal track, or to improve their stability and/or appearance, in either case, using 
conventional coating agents and procedures well known in the art. 

Compositions for oral nse may be in the form of hard gelatin capsules in which the 
active ingredient is mixed with an Inert solid diluent, for example, calcium carbonate, calcinm 
pho sphate or kaolin, or as soft gelatin capsules in which the active ingredient is mixed with 
water or an oil such as peanut oil, liquid paraffin, or olive 0 iL 

Aqueous suspensions genera lly contain the active ing redient in finely D owd ere rf form 



' — — — - ... JL — ***** yiii> *ii LlXJXZXjf pUWUtUCU 

10 together with one or more ^spending agents, such as sodium carbosymethylceUulose, 

methylcellubse, hydroxypropylme^ylceUulose, sodium alginate, polyvinyl-pyrrolidone, gum 
tragacanth and gum acacia; dispersing or wetting agents such as lecithin or condensation 
products of an alkylene oxide with forty acids (for example polyethylene 8teara te), or 
condensation products of ethylene oxide with long chain aliphatic alcohols, for example 

15 ^t^^yk*^^ 

derived from fatty acids and a hexitol such as polyoxyethylene sorbitol monooleate, or 
condensation products of ethylene oxide with long chain aliphatic alcohols, for example 
heptadecaethyleneoxycetanoL or condensation products of ethylene oxide with partial esters 
derived from fatty acids and a hexitol such as polyoxyethylene sorbitol monooleate, or 

20 <*^*ioh products of et^^ 

hexitol anhydrides, for example polyethylene sorbitan monooleate. The aqueous suspensions 
may also contain one or more preservatives (such as ethyl or propyl p-hydroxybenzoate, and- 
oxriauts (Buch as ascorbic acid), colouring agents, flavouring agents, and/or sweetening 
agents (such as sucrose, saccharine or aspartame). 

25 (My suspensions may be formulated by spending the active ingredient in a vegetable 
on (such as arachis oil, olive oil, sesame oil or coconut oil) or in a mineral oil (such as uquid 
paraffin), The oily suspensions may also contain a thickening agent such as beeswax, hard 
paraffin or cetyl alcohol Sweetening agents suchas those set out above, and flavouring 
agents may be added to provide a palatable oral preparation. These compositions may be 

30 preserved-by the addition of an anti-oxidant such as-ascorbic acid. 

Dispersible powders and grannies suitable for preparation of an aqueous suspension by 
the addition of water generally contain the active ingredient together with a dispersing or 
wetting agent, suspending agent and one or more preservatives. Suitable dispersing or 
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wettfag agents aud spending agents are exemplified by those already mentioned above 
Additional extents suchas sweetening, flavouring and colouring agents, may also be 
present. 

The pharnaocutfcal compositions of the invention may d» be Intheformof 
5 ofl-m-wateremuMona. TlKOily p hMonmyteavegaal)boaiS11£hasol . YeoilormoM8oit 
or 8 mmera] oil, such as for example liquid paraffin or a mixture of any of these. Suitable 
"tetafrmg asm* may be, fi* a^fe, oamnfly.^^ guna, such ae^ acacia or *rm 
tragacarth, naturaUy^carttoii phoaphaUdea euch as soya bean, feoithin, an or partial 
eater, derived ftomfctty acid, end hexitol anhydrides «br example sorbtamonooleae) ami 

sorbtatl namooleate. Ha amnfekm. may also conMn sweetenmg, fhm.uring and 
preservative agents. 

Syrups and elixirs may be formulated with sweetening agents such as glycerol 
propylene glycol, sorbitol, aspartame or sucrose, and may also contain a demulcent 
15 preservative, flavouring andVor colouring agent. 

The pluuTuaceutical compositions may also be in the form of a sterile injectable aqueous 
or oHy suspension, which may be formulated according to known procedures using one or 
»ore of the appropriate dispersing or wetting agents and suspeuding agents, wMchhave been 
ment^d above. A sterile injectable preparation may also be a sterile injectable solution or 
20 sn^ensfonmanon-to^ 
l»3-outanediol 

Suppository foamuktions may be prepared by mixing the active ingrolient with a 
suatable non-irritating excipient, which is solid at ordinary tomperatures but liquid at the rectal 
^t^mt^tt^^^^^^^^ Suitableexcipients 
25 include, for example, cocoa butter and polyethylene glycols. 

Topical formulations, such as creams, ointments, gels.aud aqueous or oily solutions or 
suspensions, may generally be obtained by formulating an active ingredient with a 
conventional, topically acceptable, vehicle or diluent using conventional procedure well 
known in the ait 

30 Compositions for administration by insufflation may be in the form of a finely divided 
powder containing particles of average diameter of, for example, 30u or much less, the 
powder itself comprising either active ingredient alone or diluted with one or mote 
Physiologically acceptable carries such as lactose. The powder fox insufflation is then 
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convenienily retained in a capsule containing, for example, 1 to 50mg of active ingredient for 
use with a turbo-inhaler device, such as is used for insufflation of the known agent sodium 
CTOmoglycate. 

Compositions for administration by inhalation may be in the form of a conventional 
5 pressurised aerosol arranged to dispense the active ingredient either as an aerosol containing 
finely divided solid or liquid dmpleta, Conventional aerosol propeHantB such as volatile 
fluorinated hydrocarbons or hydrocarbons may be used and the aerosol device is conveniently 
arranged to dispense a metered quantity of active ingredient, 

FoT farther info rmation on Formulation the reader is referred to Chapter 25.2 in Volume 

10 5 of Comprehensive Medicinal Chemistry (Cfcrwin Hansen; Chanroanof BlBtoflal"BIBrsa)~ 
Pergamon Press 1990. 

The amount of active ingredient that Is combined with one or more excipients to 
produce a single dosage form wffl necessarily vary depending upon the host treated and the 
particular route of administration. For example, a formulation intended for oral 

15 administration to humans will generally contain, for example, from 0.5 mg to .2 g of active 
agent compounded with an appropriate and convenient amount of excipients which may vary 
from about 5 to about 98 percent by weight of the total composition. Dosage unit forms will 
generally contain about 1 mg to about 500 mg of an active ingredient. For former information 
on Routes of Adnnnistration and Dosage Regimes the reader is referred to Chapter 25.3 in 

20 Volume 5 of Comprehensive Medicinal Chemistry (Corwin Hansen; Chairman of Editorial 
BIBDrd), Pergamon Press 1990. 

The size of the dose for therapeutic or prophylactic purposes of a compound will 
naturally vary according to the nature and severity of the conditions, the age and sex of the 
animal or patient and the route of administration, according to well known principles of 

25 medicine. 

In using a compound for therapeutic or prophylactic purposes it will generally be 
administered so that a daily dose in the range, for example. 0,5 mg to 75 mg per kg body 
weight is received, given if required in divided doses, In general lower do sea wffl be 
adniinistered when a parenteral routs is employed. Thus, for example, for intravenous 
30 administration, a dose in the range, for example, 0.5 mg to 30 mg per kg body weight will 
generally be used. Sunilarly, for administration by inhalation, a dose in the range, for 
example, 0.5 mg to 25 mg per kg body weight will be used. Oral admimstration is however 
preferred. 
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Having identified that the d|gS gene is implicated in IBD, this presents many molecular 
diagnostic opportunities. It is known to persons skilled in the art that clinically significant 
information may be obtained by the measurement of the levels of nucleic acids, proteins or 
other analytes that occur within biological samples. When nucleic acids, proteins or other 
5 analytes occur in polymorphic form then there may also be diagnostic utility in by identifying 
which of the various versions of said polymorphic nucleic acids, proteins or other analytes 
occur Within a sample. 

An investigator may wish to measure the levels of DLG5 protein or to measure the 
levels of digs mRNA transcript present in a sample, An investigator may also wish to 
10 perform nucleic acid sequence analyses to detect variant nucleotides present within the 
sample, these analyses may be performed on either the DNA or RNA fraction of the sample 
and are well known to the person skilled in the art. An investigator may also wish to perform 
protein sequence analysis either directly, by degradation based techniques which are well " 
known in the art, or indirectly by molecular recognition techniques including immunoassay, 
15 or by techniques based on detecting changes in the. physical characteristics of the protein such 
as functional or substrate specificity assays or iso-electric focusing. 

According to a further aspect of the invention there is provided a method for diagnosing 
or prognosing or monitoring tBD, comprising testing a biological sample for aberrant levels 
ofDLGS. 

20 The term "aberrant levels" refers to levels that are outside the normal range. The normal 
range can be determined by testing many normal tissues or may be determined from a side by 
side comparison of the test sample with the normal or control sample. For the purposes of 
this application, aberrant expression refers to a 1.5 -fold difference or more in level of nucleic 
acid in a disease sample compared to control normal. Nucleic acid as used herein refers to 

25 both RNA and DNA 

The test biological sample is conveniently a sample of sinovial fluid, blood, buccal 
scrape, urine or other body fluid or tissue obtained from an individual. 

The invention lies in the identification of the gene identified herein being linked to IBD 
disease prevalence. Accordingly , in part, the invention is directed to any diagnostic method . 
30 capable of assessing the differential expression level, relative to expression in control tissues, 
of the dlg5 gene identified herein, either alone or as a paneL In particular, such methods 
include assessment of mRNA transcript levels and/or protein levels, The presence of aberrant. 
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expression levels of the gene indicating the presence oflBD or an increased likelihood to 
develop the disorder. 

As noted above, in one embodiment die diagnostic/detection methods of the invention 
are employed to detect the presence of one or more SNPs or small insertions, deletions or 
5 duplications of DLG5 or dlg5. Suitable SNPs and deletions of DLG5 or dlgS include those 
identified in Table 3. 

Knowledge of polymorphisms can be of assistance in identifying patients susceptible to 
particular diseases and those most suited to therapy with particular pharmaceutical agents (the 
latter is often termed 'pharmacogenetic s") . Pharmacogenetics can also be used in 

10 pharmaceutical research to assist the drug selection process. Polymorphisms are used in 
mapping the human genome and to elucidate the genetic component of diseases. The reader 
is directed to the following references for background details on pharmacogenetics and other 
uses of polymorphism detection; Linder et aL (1997), Clinical Chemistry, 43:254; Marshall 
(1997), Nature Biotechnology. IS: 1249; International Patent Applicatirin WO 97/40462, 

15 Spectra Biomedical; and Schafer et al, (1998), Nature Biotechnology. 1633; 

A haplotype is a Bet of alleles found at linked polymorphic sites (such as within a gene) 
on a single (paternal or maternal) chromosome. If recombination within the gene is random, 
there may be as many as 2 n hapbtypes, where 2 is the number of alleles at each SNP and n is 
the number of SNPs. One approach to identifying mutations or polymarphisms which are 

20 correlated with clinical response is to carry out an association study using all the hapbtypes 
that can be identified in the population of interest The frequency of each haplotype is limited 
by the frequency of its rarest allele, so that nucleotide sequence polymorphisms with low 
frequency alleles are particularly useful as markers of low frequency haplotypes. As 
particular mutations or polymorphisms associated with certain clinical features, such as 

25 adverse or abnormal events, are likely to be of low frequency within the population, low 
frequency nucleotide sequence variations may be particularly useful in identifying these 
mutations (for examples Bee: Linkage disequilibrium at the cystathionine beta synthase (CBS) 
locus and the association between genetic variation at the CBS locus and plasma levels of 
homocysteine (De Stefano et aL, Ann Hum Genet (1998) 52:481-90; and. Keightley et at, 
30 Blood (1999) 93:4277-83). 

Clinical trials have shown that patient response to treatment with pharmaceuticals is 
often heterogeneous. Thus there is a need for improved approaches to pharmaceutical agent 
design and therapy. 



BEST AVAILABLE COPY 



D. JUn. 2M'j 15:42 / ViLUBflL If fAltNIS NUj}//y r IB 

101073 

e • 

-15- 

Point mutations in polypeptides will be referred to as follows: natural amino acid (using 
1 or 3 letter nomenclature), position, new amino acid. For (a hypothetical) example "D25K" 
or "Asp2SLys" means that at position 25 an aspartic acid (D) has been changed to lysine (K). 
The presence of a particular nucleic acid base at a polymorphism position will be 
5 repreSentedby folto wing the polymorphismpositioa For (ahypo&etical) example, 

the presence of adenine at position 300 win be represented as: 300A. 

We provide examples of nucleotide polymorphisms, including those that affect the 
amino acid sequence of the DLGS protein. Such amino acid changing porymarphimB are . 
indicated in Table 3 as "hon-synonymous". 
10 Nucleotide polymorphisms (mutations) in the promoter and VJR regions may also 
affect the transcription and expression of the dlgS gene leading to either increased or 
decreased levels of expression or to unregulated activity of the DLGS protein in vivo. Such 
alterations in the level of expression of the DLGS protein in vivo may result in a gain or loss 
.offunctiorvwblchisofc^ Recently, it has been reported fliat even 

IS polymorphisms that do not result in an amino acid change can cause different structural folds 
of mRNA with potentially different biological functions (Shen et at, (1999) Proc Natl Acad 
Soi USA 96:7871-7876). 

m one embodiment of the invention the screening methods described herein utilise a 
• DLG5 protein variant which is transcribed from a nucleic acid sequence based on that shown 
20 inSBQIDNO:l. 

Nucleotide polymorphisms within dhj5 or DLG5 may also he used as diagnostic 
takers Of predisposition to disease. Genntyping nucleotide sequence variants in populations 
suffering fromlBD and in control populations not suffering from IBD but matched for factors 
including, but not limited to, racial ancestry, country of origin, sex, age and body mass index 
25 may allow investigators to identify increased risk factors associated with the development of 
IBD disease according to the inheritance of certain SNP genotypes or haplotypes which are 
more prevalent in populations with IBD compared to their incidence in the corresponding 
control populations. This may enable screening for individuals at increased risk of 
devetoping IBD by measuring the genotypes and haplotypes of these nucleotide sequence 
30 polymorphisms within non-symptomatic individuals. We have discovered novel sequence 

polymorphisms in the dlgS gene which may be useful for the diagnosis of IBD. Public domain - 
nucleotide sequence variations, which may also be useful for the diagnosis of IBD, ox for 
research into IBD, are also Identified herein Table 3 lists the sequence polymorphisms in 
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dlg5. Those database-derived polymorphisms present in the public domain are annotated as 
either is- or tso-. The other SNPs, annotated DLGe, are believed to be identified herein, for 
the first time. Tito- stands for the SNP consortium. 

A nucleotide sequence variation or polymorbJsms could be a single nucleotide 
5 polymoihism, adeletion of one or several nucleotides, a duplication of one or several 
nucleotides or an insertion of one or several nucleotides in the nucleotide sequence of the 
gene or in sequences modulating the expression of the dlgS gene. 

According to one aspect of the present invention there is provided a method for the 
diagnosis of a single nucleotide polymorphism associated with IBP, which method comprises 
10 <ktenniniiigfiWBm^^ 

to one or more of SEQ ID NOs: 7, 9, 1 1, 13, 15, 17, 19, 21, 23, 25, 27, 29, 31, 33, 35, 37, 39. 
41, 43, 45, 47, 49, 51, 53, 55, 57. 59, 61, 63, 65, 67, 69, 71, 73, 75, 77, 79, 81, 83, 85, 87, B9, 
91 and 93; and determinmg the status of tne human by reference to polym^ 
Withrespect to the mutation disclosed in SBQ ID NO: 32 and 33, the nucleotide at position 
15 16 will either he C, or in the allele with the 7-base deletion, a G. 

The term human includes both a human having or suspected of having inflammatory 
bowel disease and an asymptomatic human who may be tested for predisposition or 
susceptibility to IBD, At each position the human may be homozygous for an allele or the 
human may be a heterozygote. 
20 In one embodiment of the invention preferably the method for diagnosis described 
herein is one in which the single nucleotide polymorphism boated at position 16 (according 
to SBQ ID NO; 7) is the presence of Q and/or A. 

In one embodiment of the invention preferably the method for diagnosis described 
herein is one in which the single nucleotide polymorphism located at position 16 (according 
25 to SBQ ID Na 9) is Iks present of C and/or T. 

In one embodiment of the invention preferably the method for diagnosis described 
herein is one in which the single nucleotide polymorphism located at position 16 (according 
to SBQ ID NO: 11) is the presence of C and/or T. 

In one embodiment of the invention preferably the method for diagnosis described 
30 herein is one in which the single nucleotide polymorphism located at position 1 6 (according 
to SBQ ED NO; 13) is the presence of Q and/or A. 
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m one embodiment of the invention preferably the method for diagno sis described 
herein is one in which the single nucleotide polymorphism located at position 16 (accordins 
to SEQ ID NO: 15) is the presence of G and/or C 

In one embodiment of the invention preferably the method for diagnosis described 
5 herein is one in which the single nucleotide polymorphism located at position 16 (according 
to SEQ ID NO: 17) is the presence of C and/or T. 

La one embodiment of the invention preferably the method for diagnosis described 
herein is one in which the single nucleotide polymorphic located at position 16 (according 
to SEQ ID NO; 19) is the presence of A and/or Q. 
10 In one embodiment of the invention preferably the method for diagnosis described 
herein is one in which the single nucleotide polymorphism located at position 1 6 (according 
to SEQ ID NO: 21) is the presence of C and/or A. 

In one embodiment of the invention preferably the method for diagnosis described, 
herein is one in which the single nucleotide polymorphism located at position 16 (according 
15 toSEQIDNO;23)i s thepresenceofCana7orA * 

m one embodiment of the invention preferably the method for diagnosis described 
herem is one in which the single nucleotide polymorphism located at position 16 (according 
to SBQ ID NO: 25) is the presence of G and/or A 

m one embodiment of the invention preferably the method for diagno^ described 
20 herein is one in which &e single nucleotide polymorph located at position 16 (according . 
to SEQ ID NO: 27) is the presence of C and/or T. 

In one embodiment of the invention preferably the method for diagnosis described • 
herein is one in which the single nucleotide polymorphism located at position 16 (according 
toSBQroNOa^isthepresenceofCand/orG. 
25 ih one embodiment of the invention preferably the method for diagnosis described 

herein is one in which the single nucleotide polymorphism located at position 16 (according '• 
to SEQ ID NO: 31) is the presence of C and/or T. 

mone embodiment of the invention preferably the method for diagnosis described 
herein u one in which the single nucleotide polymorphism located at position 16 (according 
30 to SEQ ID NO; 35) is the presence of G and/or A 

fo one embodiment of the invention preferably the method for diagnosis described 
herefoisonemwhichthesfo^ 

toSBQIDNO:37)isthepresenceofGand/orC. 
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In one embodiment of the invention preferably the method for diagnosis described 
herein is one in which the single nucleotide polymorphism locate* at position 16 (according 
to SEQ m NO: 39) fc the presence of G and/or A. 

In one embodiment of the invention preferably the method for diagnosis described 
5 herein is one in which the single nucleotide polymorphism located at position 16 (adding 
to SBQ ID NO: 41) is the presence of C and/or T. 

hi one embodiment of the invention preferably the method for diagnosis described 
herein Is one in which the single nucleotide polymorphism located at position 16 (according 
to SEQ ID NO : 43) is the presence of G and/or A. *' 
10 5 006 embodiment of the invention preferably the methodT&TdiagnoWaW^ 



herein is one in whioh the single nucleotide polymorphism located at position 16 (according 
to SEQ 3D NO: 45) is the presence of C and/or T. 

In one embodiment of the invention preferably the method for diagnosis described 
herdn is one in which the single nucleotide polymorphism located at position 16 (according 
15 to SEQ ro NO: 47) is the presence of C and/or T. 

In one embodiment of the invention preferably the method for diagnosis described 
herein is one in which the single nucleotide polymorphism located at position 16 (according 
to SEQ ID NO; 49) is the presence of C and/or T. 

In one embodiment of the invention preferably the method for diagnosis described 
20 herein is one in which the single nucleotide polymorphism located at position 16 (according 
to SEQ ID NO: 51) is the presence of C and/or T. 

In one embodiment of the invention preferably the method for diagnosis described 
hetein is one in which the single nucleotide polymorphism located at position 16 (according 
to SEQ DD NO: 53) is the presence of Q and/or C. 
25 In one embodiment of the invention preferably the memod for diagnosis described 
herein is one in which the single nucleotide polymorphism located at position 16 (according 
to SBQ ID NO: 55) is the presence of C and/or T. 

In one embodiment of the invention preferably the method for diagnosis described 
herein is one in which the single nucleotide polymorphism located at po sition 16 (according 
30 toSBQroNO:57)i8thepresenceofQand/orC. 

m one embodiment of the invention preferably the method for diagnosis described 
herein is one in which the single nucleotide polymorphism located at position 16 (according 
to SEQ JD NO: 59) is the presence of C and/of T. 
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In one embodiment of the invention preferably the method for diagnosis described ' 
herein is one in which the single nucleotide rxnyrnorphism located at position 16 (according 
to SEQ ID NO: 61) is the presence of C and/or A. 

In one embodiment of the invention preferably the method for diagnosis described 
5 herein is one in which the single nucleotide polymorphism located at position 16 (wording 
to SEQ ID NO: 63) is the presence of T and/or A. " 

In one embodiment of the invention preferably the method for diagnosis described 
herein is one-* which the single nucleotide polymorphism located at position 16 (according 
. to SBQ m NO: 65) is the presence of C and/or G. 
10 In one embodiment of the invention preferably the method for diagnosis described 
herein is one in which the single nucleotide polymorphism located at position 16 (according 
to SEQ ID NO: 67) is the presence of G and/or A. 

ha one embodiment of the invention preferably the method for diagnosis Scribed 
herein is one in which the single nucleotide polymorphism located at position 1 6 (according 
15 toSEQ!DNO:69)isthBpresenceofCand/orT. 

In one embodiment of the invention preferably the method for diagnosis described 
herein is one in which the single nucleotide polymorphism located at position 16 (according ■ 
to SEQ ID NO: 71) is the presence of A and/or G (as aresult of a single base deletion), 
to one embodiment of the invention preferably the method for diagnosis described 
20 herein is one in which the single nucleotide polymorphism located at position 16 (according . 
to SEQ ID NO: 73) is the presence of C and/or T. 

In one embodiment of the invention preferably the method for diagnosis described 
herein is one in which the single nucleotide polymorphic located at position 16 (according 
to SBQ ID NO: 75) is the presence of C and/or T. 
25 In one embodiment of the invention preferably the method for diagnosis described 
herein is one in which the single nucleotide polymorphism located at position 16 (according 
to SEQ ID NO: 77) is the presence of C and/or A." 

In one embodiment of the invention preferably the method for diagnosis described 
herein is one in which the single nucleotide polymorphism located at position 16 (according 
30 toSBQIDNO^^isthepresenceofCanoVorT. 

In one embodiment of the invention preferably the method for diagnosis described 
herem rs one in which the single nucleotide polyroorphism located at position 16 (according 
to SBQ ID NO: 8 1) is the presence of C and/or T. 
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In one embodiment of the invention preferably the method for diagnosis described 
herein is one in which the single nucleotide polymorphism located at position 16 (according 
to SHQID NO: 83) is the presence of G and/ox A. 

In one embodiment of the invention preferably the method for diagnosis described 
5 herein is one in which the single nucleotide polymorphism located at position 16 (according 
to SBQE> NO: 85) is the presence of C and/or T. 

In one embodiment of the invention preferably the method for diagnosis described 
herein is one in which the single nucleotide polymorphism located at position 16 (according 
to SBQ ID NO: 87) is the presence of A and/or G. 

~10 ■ froneernbodimemofik^ 

herein is one in which the single nucleotide polymorphism located at position 1 6 (according 
to SEQ ID NO: 89) is the presence of G and/or A : 

In one embodiment of the invention preferably the method for diagnosis described 
herein is one in which the single nucleotide polymorphism located at position 16 (according 
15 to SBQ m NO:. 91) is the presence of C and/or G. 

In one embodiment of the invention preferably the method for diagnosis described 
herein is one in which the single nucleotide polymorphism located at position 16 (according 
to SEQ ID NO: 93) is the presence of G and/or A 

In one embodiment of the invention preferably the method for diagnosis described 
20 herein iB one in which there is the presence or absence of a 7-baae deletion located at position 
16 (according to SBQ ID NO: 33). - 

In another aspect, of the invention there is provided a method for the diagnosis of IBD or 
detemiining susceptibility to develop IBD, which method comprises: 
(i) obtaining a protein or nucleic acid containing' sample from an individual; 
25 (ii) detecting the presence or absence of a variant DLG5 on the basis of the presence of a 
polymorphic amino acid within the DLGS protein, or a polymorphic base within the dlgS 
gene sequence; and, 

(iii) dateimining the status of the human by reference to the presence or absence of a 
polymorphism in DLGS. 

30 In one embodiment the polymorphic amino acjM I is located at position 30, 121, 514, 957, 
979 or 1371 according to SEQ ID NO: 2, 

In a particular embodiment the polymorphism is selected from the group consisting of: 
Gln30Arg, Serl21Gly, Giu514Gln, Aig957Hi 9 /pro979Leu and Pro 1371 Gin. 
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BIOS Scientific Publishers Limited, 1097. 
20 Tabkl: Abbreviations: 



ALEX™ 



APBX 



ARMS™ 



b-DNA 



cmc 



Bp 



COPS 



ERBT 
LOT 



MASDA 
NASBA 



Ampupeatiop refractory mutation system linear 



Arrayed primer extension 



extension 



Amplication refractory mutation system 



Branched DNA 
Chemical mismatch cleavage 
base pair 




Competitive ongomtcleotide priming system 




Fluorescence resonance energy transfer 



Ligase chain reaction 



Multiple allele specific diagnostic assay 
Nucleio acid i 
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OLA 


Oligonucleotide ligation assay 


PCR 


Polymerase chain reaction 


FIT 


Protein truncation test 


RFLP 


Restriction fragment length, polymorphism 


SDA 


Strand displacement amplification 


SERRS 


Surface enhanced raman resonance spectroscopy 


SNP 


Single nucleotide polymorphism 


SSCP 


Single-strand conformation polymorphism analysis 


SSR 


Self sustained replication 


TGGB 


Temperature gradient gel electrophoresis 


3'UTR . 


3 " untranslated region 



List 1 - Mntatinm TlPti-iticm Techniques 

General DNA sequencing, Sequencing by hybridisation 

Scanning: FIT*, SSCP, DGGE, TGGB, deavase, Heteroduplex analysis, CMC. Enzymatic 
5 mismatch cleavage 

81 Note: not useful for detection of promoter polymorphisms. 

Hybridisation Based; Solid phase hybridisation: Dot blots, MASDA, Reverse dot blots, 
OKgonucleotJde arrays (DNA Chips) 

Solution phase hybridisation: Taqman™ - US-5210015 & US-5487972 CHof&nann-La 
10 Roche), Molecular Beacons - Tyagi et al (1996), Nature Biotechnology, 14, 303; WO 
95/13399 (Public Health Inst, New York) 

Extension Based: ARMS™- allele specific amplification (as described in European parent 
No. EP-B-332435 and US patent No. 5,595,890), ALEX™ - European Patent No, BP 332435 
Bl (Zeneca Limited), COPS - Gibbs et al (1989), Nucleic Acids Research, 17, 2347. 
15 Incorporation Based: Mini-sequencing, APEX 

Restriction Enzyme Based: RFLP, Restriction site generating PCR 
Ligation Based: OLA 

Other: Invader assay, Hybridisation protection assay 

20 List 2 - Signal Generation prDgt^ggn, fiyfrgn^ 

Fluorescence: FRET, Fluorescence quenching, Fluorescence polarisation - United Kingdom 
Patent No. 2228998 (Zeneca Limited) 
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0ther! ChemU ^^e, Bkctrocl^^^ RadioactMcy> Coloring 

Mass spectrometry, SERRS - WO 97/05280 (University of Strathclyde). 

5 SSR f NASBA,LCR,SDA,l>.]DNA 

Pitted mutation detection techniques include ARMS™-allele specific amplification, 
Taqman™, Mini sequencing, sequencing, RFLP, ALEX™, OLA, restriction site based PGR 
and FRET techniques. 

Particularly preferred methods include ARMS™-allele specific amplification, OLA and 
10 RPLPbassdme^ds, ARMS TI *-alleIe specific amplification is an especially preferred 
method. 

ARMS^-allele speciflc amplification (described in European patent No. EP-B-332435 
US patent No. 5,595,890 and Newton et al. (Nucleic Acids Research. Vol. 17, p.2503; 1989))! 
rekes on the complementarity of the 3 ' terminal nucleotide of the primer and its template 
15 ^3'tern^nuoleotideofthep^ ■ 
to the specific mutation, allele or polymorphism to be detected. There is a selectiva advantage 
for primer extension from the primer whose 3 ' terminal nucleotide complements the base 
mutation, allele or polymorphism. Those primers which haw a 3' terminal mismatch with the 
^ template sequence severely inhibit or prevent enzymatic primer extension. Polymerase chain • 
20 react,on or unidirectional primer extension reactions therefore result in product amplification 
when the V terminal nucleotide of the primer complements that of the template, but not, or at 
least not efficiently, when the 3' terminal nucleotide does not complement that of the 
template. 

It wai be appreciated that the test sample may equally he a nucleic acid sequence 
25 corresponding to the sequence In the test sample, that is to say that aU or a part of the region 
m the sample nucleic acid may firstly be amplified using any convenient technique e.g. 
polymerase chain reaction (PGR), before analysis. The nucleic acid may be genomic DNA or 
fractionated or whole cell RNA. In one embodimant theRNAis whole cell SNA and is used 
dnectly as the template for labelling a first strand cDNA using random primers or poly A 
30 pnmers. The nucleic acid or protein in the. test sample may be extracted fit>m the sample 
wording to standard methodologies (Sambmolc et aL 'Molecular Cloning- A Laboratory 
manual", second edition. Cold Spring Harbor, NY (1989)). 
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It will be apparent that the gene sequence disclosed for dlg5 (as depicted in SBQ ID NO: 
1) 3s a representative sequence, In normal individuals there are two copies of each gene> a 
maternal and paternal copy, which will likely have soma sequence differences, moreover 
within, a population there will exist numerous allelic variants of the gene sequence, indeed the 
5 Examples identify numerous SNPs and other mutations within dJgS gene that represent allelic 
variants within the population. It will be appreciated that the diagnostic methods and other 
aspects of this invention extend to the detection etc. of any of these sequence variants. 
Preferred sequence variants are those that possess at least 90% and preferably at least 95% 
sequence identity (nucleic acid or amino acid) to DLG5 depicted in SEQ ID No. 1 or 2. 

10 Nucleic acid sequence identity can also be gauged T^^^iBi^onstuaies whereby, under — 
stringent hybridisation and wash conditions, only closely related sequences (for example, 
those with >90% identity) are capable of forming a hybridisation complex. 
In a further aspect, the diagnostic methods of the invention, are used to assess the 
predisposition and/or susceptibility of an individual to IBD, and the present invention may be 

15 used to recognise individuals who are particularly at risk from developing IBD conditions. 
In a further aspect* the diagnostic methods of the invention are used in the 
development of new drug therapies, which selectively target one or more allelic variants 
identified herein, Identification of a link between a particular allelic variant and 
predisposition to disease development or response to drug therapy may have a significant 

20 iiqpact on the design of new drugs. Drugs may be designed to regulate the biological activity 
of variants implicated in the disease process whilst tinmiTnisfng effects on other variants. 

In a further diagnostic aspect of the invention the presence or absence of variant 
nucleotides is detected by reference to the loss or gain of, optionally engineered, sites 
recognised by restriction enzymes, The parson of ordinary skill will be able to design and 

25 implement diagnostic procedures based on the detection of restriction fragment length : 
polymorphism due to the loss or gain of oijje or more of the sites. 

The invention further provides nucleotide sequence information which can be used to 
design assays for detection of the polymorphisms of the invention. 

The invention further provides nucleotide primers, which detect the polymorphisms of 

30 the invention. _ 

The invention further provides nucleotide probes, which can detect the polymorphisms 
of the invention. 
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The amino acid sequence method for diagnosis is preferably one which is determined by 
imnmnological methods swh as^ e^ 

The levels of the DLG5 can be assessed from relative amounts erf mRNA, cDNA, 
genomic DNA or polypeptide sequence present in the test sample. Where RNA is used, it 
5 may be desired to convert the RNA to a complementary cDNA and during this process it may 
be desirable to incorporate a suitable detectable label into the cDNA 

In a preferred embodiment the method of the invention relies on detection of mRNA 
transcript levels. This involves assessment of the relative mRNA transcript levels of dlg5 in a 
sample, and comparison of sample data to control data. The gene transcript can be detected 
10 individually, or, is preferably detected amongst a panel of other disease-linked gene dlg5 from 
which a transcript profile can be generated Levels of dlg5 mRNA in the test sample can be 
detected by any technique known in the art. These include Northern blot analysis, reverse 
transcriptase-PCR amplification (RT-PCR), mlcroanay analysis and RNAse protection. 
In one embcdhnent, levels of dlgo RNA in a sample can be measured in a Northern blot 
15 assay. Here, tissue RNA is fractionated by electrophoresis, fixed to a solid membrane 
support, such as nitrocellubse or nylon, and hybridised to a probe or probes capable of 
selectively hybridising with the dlg5 RNA to be detected. The actual levels may be 
quantitated by reference to one or more control housekeeping genes. Probes may be used 
singly or in combination. This may also provide information on the size of mRNA detected 
20 by the probe. Housekeeping genes are genes which are involved in the general metabolism or 
maintenance of the cell, and are considered to be expressed at a constant level irrespective of 
cell type, physiological state, or stage in the cell cycle. Examples of suitable housekeeping 
genes are: beta actin, GAPDH, histone H3.3 or ribosomal protein L13 (Koehler et aL, 
Quantitation of mRNA by Polymerase Chain Reaction. Springer-Verlag, Germany (1995)). 
25 To gauge relative expression levels, a control sample can be run alongside me test sample or, 
the test result/value can be compared to dTgS expression levels expected in a normal or control 
tissue. Thes* control value* can be generated from prior test experiments using normal or 
control tissues, to generate mean or normal range values for dlg5. 

In another embodiment, the dlg5 nucleic acid in a tissue sample is amplified and 
30 quantitatively assayed. The polymerase chain reaction (PCR) procedure can be used to 
amplify specific nucleic acid sequences through a series of iterative steps including 
denaruratfon, annealing of oUgonucleotide primers (designed according to the sequence 
disclosed in SBQ ID NO. 1), and extension of the primers with DNA polymerase (see, fer 
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exampte, Muffls. et al, U.S. patent No. 4,6B3,202; Loh et al, Science 243:217 (1988)). In 
reverse tearaeriptase-PCR (RT-PCR) this procedure is preceded by a reverse transcription Btep 
to allow a large amplification of the number of copies of mRNA (Koshler et al , supra). 
Other known nucleic acid amplification procedures include transcription-based amplification 
5 systems <TAS) such as nucleic acid based sequence application (NASBA) and 3SR (Kwoh et 
al., Proc Natl Acad SciUSA 86:1173 (1989), Gingeras et al, PCT application WO 
88/10315), the ligase chain reaction (LCR, Bee European Application No. 320308), Strand 
Displacement Amplification (SDA), '"raceVtae sided PGR" and others (Frohman, PCR 

Protocols: a Guide to Methods and A pplicatjons. Academic Press, NY (1990); Oharaet si, 

10 Proc Nad Aoad ScL USA 86:5673-5677 (1989)). Quantitation of RT-PCR products can be" - 
done while the reaction products are building up esponentially , and can generate 
diagnostically useful clinical data. In one embodiment, analysis is carried out by reference to 
one or more housekeeping genes which are also amplified by RT-PCR. Quantitation of RT- 
PCR product may be undertaken, for example, by gel electrophoresis visual inspection or 
15 image analysis, HPLC (Koehier et aL, suprd) or by use of fluorescent detection methods such 
as intercalation labeling, Taoman probe (Higuchi et aL, Biotechnology 10:413-417 (1992)), 
Molecular Beacon (Piatek et aL, Nature BiotecbnoL 4:3S9-363 (1998)), primer or Scorpion 
primer CWhitoom.be et al., Nature Biotech 17:804-807 (1999)); or other fluorescence detection 
method, relative to a control housekeeping gene or genes as discussed above. 
20 Dlg5 RNA measurements can also be carried out on sinovial fluid, blood or serum 
samples. Preferably, the RNA is obtained from a peripheral bbod sample. In the case of 
soluble RNA in the blood serum, the low abundance of mRNA expected would necessitate a 
sensitive test such as RT-PCR (Kopreaki et al, Clin Cancer Res 5: 1961-5 (1999)). A whote 
blood gradient may be performed to isolate nucleated cells and total RNA is extracted such as 
25 by the Rnazole B method (Tel-Test Inc., Friendswotth, Tex.) or by modification of methods 
known in the art such as described in Sambrook et al, (supra). 

In a preferred embodiment of the invention, the diagnosis/deteotion method of the : 
Invention involves assessing dlg5 transcript levels using micro array analysis. Microarxay 
technology makes it possible to simultaneously study the expression of many thousands of 
- 30-genes in a single experiment. Analysis of gene expression in human tissue (e.g. biopsy tissue) 
can assist in the diagnosis and prognosis of disease and the evaluation of risk for disease. A 
comparison of levels of expression of various genes from patients with defined, pathological 
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dfcease conditions with norma] patients enables an expression profile, characteristic of 
disease, to be created. 



Probes are made that selectively hybridise to the sequences of the target dlgS gene in the 
testsample. These probes, perhaps together with other probes and control probes, are bound 
5 st discrete locations on a suitable support medium such as a nylon filter or microscope slide to 
form a transcript profiling array. The diagnostic method involves assessing the relative 
mRNA transcript level of dlgS in a clinical sample. This canbe done by radioactively 
labelling, or non-radioactivaly labelling the tissue mRNA, which can be optionally purified 
from total RNA, in any of a number of ways well known to the art (Sambroot et al, supra). 
10 The probes can be directed to any part or all of the target dlg5 mRNA. 

b another embodiment of the invention, total dlg5 RNA or DNA is quantified and 
compared to levels in control tissue or expected levels from pre tested standards. DNA and/or 
RNA may be quantified using techniques well known in the art. Messenger RNA is often 
quantitated by reference to internal control mRNA levels within the sample, often relative to 
15 housekeeping genes (Koehler et al., supra). 

hx a preferred embodiment hybridisation signals generated are measured by computer 
software analysis of hnages on phosphorimage screens exposed to radioactively labelled 
tissue RNA hybridised to a microarray of probes on a solid support such as a nylon 
membrane. In another, quantities are measured by densitometry measurements of radiation, 
20 sensitive film (e.g. X-ray film), or estimated by visual means, m another embodiment 
quantities are measured by use of fluoresce^ labelled probe, which may be a mixture of 
biopsy and normal RNA differentially labelled with different fluorophores, allowing 
quantities of dlg5 mRNA to be expressed as a ratio versus the normal level The solid suppon 
m this type of experiment is generally a glass microscope slide, and detection is by 
25 fluorescence microscopy and computer imaging. 

The detection of specific interactions may be performed by detecting the positions 
where the labelled target sequences are attached to the array. Relabelled probes can be * 
detected using conventional autoradiography techniques. Useof scanning autoradfography 
with a digitised scants and suitable software for analysing the results is preferred Where the 
30 label i, a fluorescent label, the apparatus described, e.g. in International Publication No WO 
90/15070; US Patent No. 5, 143,854 or US Patent No. 5,744*305 may be advantageously 
applied. Indeed, most array formats use fluorescent readouts to detect labelled capture* 
duplex formation. Laser confocal fluorescence microscopy is another technique routinely m 
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use (Kozal et aL, Nature Medicine 2:753-759 (1996)). Mass spectrometry may also be used 
to detect oligonucleotides bound to a DNA array (Little et al* Analytical Chemistry 69: 4540- 
4546, (1997)). Whatever the reporter system need, sophisticated gadgetry and software may 
be required in order to interpret large numbers of readouts into meaningful data (such as 
5 described, for example, hi US Patent No. 5,800,992 or International Publication No. WO 
90/04652). 

In a preferred embodiment, of the micro array test, the dlg5 SNA measurement is 
generated as a value relative to an internal standard (ie. a housekeeping gene) known to be 
constant or relatively constant The histone H3.3 and r ibosomal protein L19 housekeeping 



10 genes have been shown to be cell-cycle independent and constitutively expressed in alHissues 
(Kbehler et al.> supra). For normalisation of data, several different housekeeping genes can 
be used to generate an average housekeeping measurement. 

A microarray or RT-PCR test to detect IBD or susceptibility thereto can be used where 
tissue samples containing mRNA are available; 

1 S Samples for RNA extraction must be treated promptly to avoid RNA degradation 
(Sambrook et aL, supra). This entails either prompt extraction using e.g. phenol-based 
reagents or snap freezing in e.g. liquid nitrogen. Samples can be stored at -70°C or less until 
RNA can be extracted at a later date. Proprietary reagents axe availably which allow tissue or 
cells to be conveniently stored for several days at room temperature and up to several months 

20 at 4°C (e»g. RNAlater, Ambion Lie,, TX ). Prior to extraction, methods such as grinding, 
blending or homogenisation are used to dissipate the tissue in a suitable extraction buffer. 
Typical protocols then use solvent extraction and selective precipitation techniques, 
hi another embodiment oligonucleotide probe(s) capable of selectively hybridising to dlg5 
nucleic acid, can be used to detect levels of dlg5 gene expression. 

25 Convenient DNA sequences for use in the various aspects of the invention may be 
obtained using conventional molecular biology procedures, for example by probing a human 
genomic or cDNA library with one or more labelled oligonucleotide probes containing 10 or 
more contiguous nucleotides designed using the nucleotide sequences described here. 
Alternatively, pairs of oligonucleotides one of which is homologous to the sense strand and 
~ 30 one to the antisense strand,- designed using the nucleotide sequences described herein to flank 
a specific region of DNA may be used to amplify that DNA from a cDNA library. 

Levels of dlgSgene expression can also be detected by screening for levels of 
polypeptide (DLG5 protein). For example, monoclonal antibodies xmmunoreactive with 
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DLG5 protein can be used to screen a test sample. Such immunological assays can be done in 
my convenient format known in the art These include Western blots, immmiohistochcnrical 
assays and BLISA assays. Punctfonal assays can also be used, such as protein binding 
determinations. 

5 According to another aspect of the present invention, there is provided an allele specific 
primers or probes capable of detecting a polymorphism at position 16 in one or more of SBQ 
ID NOs: 7, 9, 1 1, 13, 15, 17, 19, 21, 23, 25, 27, 29, 31, 33, 35, 37, 39, 41, 43, 45, 47, 49, 51, 
53, 55, 57, 59, 61, 63. 65. 67. 69. 71, 73, 75. 77. 79, 81, 83, 85, 87, 89, 91 and 93. The person 
of ordinary skill in the art willbe able to design suitable primers or probes using the sequence 
10 information provided herein. 

An allele specific primer is used, generally together with a constant primer, in an 
amplification reaction such as a PGR reaction, which provides me diacrimlnation between 
alleles through selective amplification of one allele at a particular sequence position e.g. as 
used for ARMS™ assays. The allele specific primer is preferably 17- 50 nucleotides, more 
15 preferably about 17-35 nucleotides, more preferably about 17-30 nucleotides. 

An allele specific primer preferably corresponds exactly with the allele to be detected 
but derivatives thereof are also contemplated wherein about 6-8 of the nucleotides at the 3> 
tenninua correspond with me allele to be detected and wherein up to 10, such as up to 8, 6, 4, 
2, or 1 of the remaining nucleotides may be varied without significantly affecting the 
20 properties of the primer. 

Preferred primers for amplification are between 15 and 60 bases, more preferably 
between 17 and 35bases in length. Probe sequences can be anything from about 25 
nucleotides in length upwards. If the target sequence is a gene of 2kb in size the probe, 
sequence can be the complete gene sequence complement and thus may also be 2kb in size. 
25 Preferably, the probe sequence is a genomic, or more preferably a cDNA, fragment of the 
target sequence and may be between 50 and 2000 bases, preferably between 200 and 750 
bases. It will be appreciated thatmultiple probes each capable of selectively hybridising to a 
different target sequence of the dlg5 nucleic acid, maybe across the complete length of the 
dlg5 gene sequence, may be prepared and used together in a diagnostic test The primers or 
30 probes may be completely homologous to the target sequence or may contain one or more 
mismatches to assist specificity in binding to the correct template sequence. Any sequence, 
which is capable of. selectively hybridiaing to the target sequence of interest, may be used as a 
suitable primer or probe sequence. It will also be appreciated that the probe or primer , 
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sequenees must hybridise to the target template nucleic add. If the target nucleic acid 1b 
donbje stranded feenomio or cDNA) then the probe or primer sequence can hybridise, to the 
sense or antisense strand. If however the target la mRNA (single stranded sense strand) the 
primer/probe sequence will have to be the antisense complement. 
5 An example of a suitable hybridisation solution when a nucleic acid is immobilised on a 
nylon membrane and the probe nucleic acid is greater than 500 bases or base pairs is: 6xSSC 
(saline sodium citrate), 0.5% SDS (Bodium dodecyl sulphate). lOOng/ml denatured, sonicated 
salmon spermDNA. The hybridisation being performed at 68°C for at least 1 hour and the 
-ffltga^Jgashed_at 6 8°C in 1 x SSC. o r for hiflher "^^y^pj ^ ssC/0. 1% SDS . 



10 An example of a suitable hybridisation solution when a nucleic acid is immobilised on a 
nylon membrane and the probe is an oligonucleotide of between 12 and 50 bases is: 3M 
trims thylammonium chloride (TMAC1), 0.01M sodium phosphate (pH 6.8), ImM EDTA (pH 
7.6) , 0.5% SDS,100M-gADl denatured, sonicated salmon spermDNA and 0.1 dried skimmed 
milk. The optimal hybridisation temperature (Tm) is usually chosen to be 5°C below the Ti 
15 of the hybrid chain. Ti is the irreversible melting temperature of the hybrid formed between 
the probe and its target. If mere are any mismatches between the probe and the target, the Tm 
will be lower. As a general guide, the recommended hybridisation temperature for 17-mers in 
3M TMAC1 is 48-50°C; for 19mers,it is 55-57°C; and for 20-mers, it is 58-66°C 

According to another aspect of the present invention there is provided an allele- specific 
20 oligonucleotide probe capable of detecting a polymorphism in human nucleic acid 

corresponding to that at position 16 of any ofSBQ ID NOs: 7, 9, 1 1, 13, 15, 17, 19, 21, 23, 
25, 27, 29. 31, 33. 35, 37, 39, 41, 43, 45, 47, 49, 51, 53, 55, 57, 59, 61, 63, 65, 67, 69, 71, 73. 
75, 77, 79, 81, 83, 85, 87, 89, 91 and 93. 

The aflele-speoifie oligonucleotide probe is preferably 17- 50 nucleotides, more 
25 preferably about 17-35 nucleotide's, more preferably about 17-30 nucleotides. 

The design of such probes will be apparent to the molecular biologist of ordinary skill 
Such probes are of any convenient length such as up to 50 bases, up to 40 bases, more 
conveniently up w 30 bases in length, such as for example 8-25 or 8-15 bases in length. In 
general such probes will conmrise base sequences entirely complementary to the 
30 corresponding wild type" or variant locus in the gene. However, if required one or more " " " 
mismatches may be introduced, provided that the discrnmnatory power of the oligonucleotide 
probe is not unduly affected. The probes of the invention may carry one or more labels to 
fecilitate detection. The sequences disclosed as SBQ ID Nos: 6-93, when in single stranded 
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form, are representative examples of allele specific, probes capable of detecting one or other of 
the polymorphic variants ofiilgS. Bach of these sequences is folly complementary to the native 
dlgS gene and one or other of the particular allelic variants. 

Primers or probes for use in any of the methods of the invention may be manufactured 
5 using any convenient method of synthesis. Examples of such methods may be found in 
standard textbooks, for example "Protocols for Oligonucleotides and Analogues; Synthesis 
and Properties," Methods in Molecular Biology Series; Volume 20; Ed. Sudhir AgrawaL 
Humana ISBN; 0-89603-247-7 (1993); ^-Edition. If required the primer*® may be labelled 
to facilitate detection. 

10 There are many conventional detectable labels such as radioisotopes, fluorescent labels, 
chemilummescent compounds, labelled binding proteins, magnetic labels, spectroscopic 
markers and linked enzymes that might be used in conjunction with the primers or probes of 
me invention. One particular example well known in the art is end-labelling with 32 P. 
Fluorescent labels are preferred because they are less hazardous than radiokbels, they provide 
15 a strong signal with low background and various different fluorophors capable of absorbing 
light at different wavelengths and/or giving off different colour signals exist to enable 
comparative analysis in the same analysis. For example, fluorescein gives off a green colour, 
rhodamine gives off a red colour and both together give off a yellow colour. 

The oUgohucleotide primers and probes of the invention are particularly suitable for 
20 detecting the genotype of a particular SNP of dlg5. 

The DLG5 protein of the invention and homologues or fragments thereof may be used 
to generate substances which selectively bind to it and in so doing regulate the activity of the 
protein. Such substances include, for example, antibodies, and the invention extends in 
particular to an antibody which is capable of binding to the protein shown in SEQ ID No: 2. 
25 In particular the antibody may be a neutralising antibody. 

As used herein the term antibody is to be understood to mean a whole antibody or a 
fragment thereof, for example a F(ab)2, Fab, FV, VHorVK fragment, a single chain 
antibody, a multimeric monospecific antibody or fragment thereof, or abi- or multi-specific, 
aittibody or fragment thereof. Bach of these types of antibody derivative and their acronyms 
30 are well known to the person skilled in the art. 

In another preferred embodiment antibodies directed against DLG5 protein can be used, 
to detect, prognose, diagnose and stage IBD. Various histological staining methods known in 
the art. including unmunochemical staining methods, may also be used. Silver stain is but 
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one method of detecting DLGS proteins. Fob- other staining methods useful in the present 
invention see, for example, A Textbook of Histology, Eds. Bloom and Fawcett, W.B. 
Saundere Co., Philadelphia (1964). 

According to a further aspect of the invention there is provided use of an antibody 
5 selective for DLGS protein, in an assay to diagnose or prognose or monitor IBD. 

The antibodies for use in this aspect of the invention can be prepared using the DLG5 
protein/polypeptides. 

Methods of making and detecting labelled antibodies are well known (Campbell; 
Monoclonal Antibody Technology, in; Laboratory Techn iques in Biochemistry and Molecular 
10 Biology, Volume 13. Eds: Burdon RetaL Elsevier, Amsterdam (1984)). The term antibody 
includes both monoclonal antibodies, which are a substantially homogeneous population, and 
polyclonal antibodies which are heterogeneous populations. The term also includes inter alia, 
humanised and chimeric antibodies. Monoclonal antibodies to specific antigens may be 
obtained by methods known to those dolled in the art, such as from hybridoma cells, phage 
IS display libraries or other methods. Monoclonal antibodies may be inter alia, human, rat or 
mouse derived, For the production of human monoclonal antibodies, hybridoma cells may be 
prepared by fusing spleen cells from an immunised animal e.g. a mouse, with a tumour cell 
Appropriately secreting hybridoma cells may thereafter he selected (Koehler & Milstein, 
Nature 2S6;495-497 (1975); Cole et al, •Monoclonal antibodies and Cancer Therapy", Alan 
20 RLiss Lie, New York N.Y.pp 77-96 (1985)). Such antibodies may be of any 
immunoglobulin class including IgG, IgM,IgE, IgA, IgD and any subclass thereof. 
Polyclonal antibodies can be generated by inmannisation of an animal (such as a mouse, rat, 
goat, horse, Sheep etc) with an antigen, such as a DLGS polypeptide. 

The DLGS polypeptides) can he prepared by various techniques known to the person 
25 skilled in the art. RNA transcripts can be used to prepare a polypeptide of the invention by in 
vitro, translation techniques according to known methods (Sambrook et al supra). 

Alternatively, the DLG5 polypeptlde(s) can be synthesised chemically. For example, by 
the Merryfield technique (J. Amer. Chem. Soc. 85:2149-2154, (1968)). Numerous automated 
polypeptide synthesisers, such as Applied Biosystems 43 1A Peptide Synthesizer also now 
-30- exist,- Alternatively, and preferably, the DLGS polypeptide^) are-producedfrom amicleotide 
sequence encoding the polypeptide using recombinant expression technology. A variety of 
expression vector/host systems may be used to express the dlg5 coding sequences. These 
include, but are not limited to microorganisms such as bacteria expressed with plasmids, 
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cosmids or bacteriophage; yeasts transformed with expression vectors; insect call systems 
tranafected wi&baculo virus expression systems; plant cell systems transfected with plant 
virus expression systems, such as cauliflower mosaic virus; or mammalian cell systems (for 
example those transacted with adenoviral vectors); selection of the most appropriate system 

5 is a matter of choice, Preferably, the DLG5 protein is expressed in eukaryotte cells, 
especially mammalian, insect and yeast cells. Mammalian cells provide post-translational 
modifications to recombinant DLG5 protein, which include folding and/or phosphorylation. 

Expression vectors usually include an origin of replication, a promoter, a translation 
initiation site, optionally a signal peptide, a poly adeny lation site, and a transcription 

10 termination site. These vectors also usually contain one or more antibiotic resistance marker . 
gene(s) for selection, As noted above, suitable expression vectors may be plasmids, cosmids 
Of viruses such as phage or retroviruses. The coding sequence of the polypeptide is placed 
under the control of an appropriate promoter* control elements and transcription terminator so 
that the nucleic acid sequence encoding the polypeptide is transcribed into RNA in she host 

15 cell transformed or transfected by the expression vector construct The coding sequence may 
or may not contain a signal peptide or leader sequence for secretion of the polypeptide out of . 
the host cell Expression and purification of the DLG5 polypeptide® can be easily 
performed using methods well known in the art (for ©sample as described in Sairihrook et aL 
supra). 

20 The DLG5 polypeptide(B) so produced can then be used to inoculate animals, from 
which serum samples, containing the specific antibody against the introduced DLG5 
protein/polypeptide, can later b& obtained. 

Rodent antibodies may be humanised using recombinant DNA technology according to 
techniques known in the art. Alternatively, chimeric antibodies, single chain antibodies, Fab 

25 fragments may also he developed against the polypeptides of the invention (Hose et al, 

Science 256:1275-1281 (1989)), using skills known in the art Antibodies so produced have a 
number of uses, which will be evident to the molecular biologist or immunobgist skilled in 
tie art, Such uses include, but are not limited to, monitoring enzyme expression, 
development of assays to measure enzyme activity and use as a therapeutic agent. Enzyme 

30 linked immunosorbent assays (ELJSAs) are well known in the art and would be particularly 
suitable for detecting the DLG5 protein or polypeptide fragments thereof in a test sample. 

The DLG5 specific antibodies can be used in an EUSA assay to detect DL.G5 protein in 
body fluids or by inmnmohifitochemifitry or other means. In addition, an antibody could be 
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used individually or as part of a panel of antibodies, together with a control antibody, which 
reacts to a common protein, on a dipstick or similar diagnostic device. 

All the essential materials and reagents required for detecting DLG5 in a test sample 
may he assembled together in a kit. Such a kit may comprise one or more diagnostic cDNA 
5 probes or oligonucleotide primers together with control probes/primers. The kit may contain 
probes immobilised on a micro array substrate such as a filter membrane or silicon-based 
substrate. The kit may also comprise samples of total RNA derived from tissues of various 
physiological states, such as normal, BPH, confined tumour and metastatic tumour, for 

example, to be used as controls. The kit may also comprise appropriate packaging and 

10 instructions for use in the methods of the invention. ~ 
According to another aspect of the present invention there is provided a diagnostic kit 
for diagnosing or prognosing or monitoring ffiD comprising, one or more diagnostic probes) 
andfor diagnostic primers) and/or antibodies capable of selectively hybridising or bmding to 
DLG5. . 

15 It will be appreciated that the term "diagnostic kit" is not intended to limit the kit to 
diagnostic use only, it also encompasses other uses such as in prognostic, stage monitoring 
and therapeutic efficacy studies. 

In a preferred embodiment, the diagnostic (detection) probes are provided on a 
micro array. 

20 Such kits may further comprise appropriate buffers) and/or polymerase(s) such as 
thermostable polymerases, for example taq polymerase. They may also comprise 
companion/constant primers and/or control primers or probes. A companion/constant primer 
is one that is part of the pair of primers used to perfonnFOL Such primer usually 
complements the template Btrand precisely. The kit may also contain control normal RNA 
25 labelled with one fluorophore (B.g.. Cy5). muse, patient RNA derived ftombiopsy or body 
fluids or cells can be labelled with another fluorophore (e.g. Cy3), the RNAs could then be 
mixed and hybridised to the array. Instrumentation to detect fluorescence ratio e,g. of. 
Cy3:Cy5 are available and could be used to detect DLG5 over-expression. 

In another embodiment the kit comprises one or more specific probes suitable for 
30 hybridisation to mRNA ifi.tissue sections in situ. The kit may.also-contain hybridisation 
buffer and detection reagents for cotourimetrio or fluorescence microscopy detection. 
In another embodirnent me kit comprises a set of specific oligonucleotide prhnerB. optionally 
labelled, for quantitation by RT-PCR ofdUj5mRNA. These primers may he Scorpion primers 
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(Whitcombe et al, Nature Biotechnol, 17:804-807, 1999) allowing accurate quantitation of 
specific PGR product Alternatively, Taqman or Molecular Beacon ptobes may be provided 
in the kit for this purpose. One form of the kit would be a microtitre plate containing specific 
reagents in several wells, to which aliquots of extracted RNA could be pipetted. The 
5 microtitre plate could be thennocycled on a suitable machine, which could also be capable of 
reading fluorescence emissions from plate wells (e.g. Pexkin Elmer 7700). 

In another embodiment the kit comprised one or more antibodies specific for the DLG5 
protein for use in imnnmoMstochemical analysis. 

In another embodiment the kit is an ELISA kit comprising one or more antibodies 
10 specific for the DLG5 protein identified herein. 

In another aspect of the invention there is provided a method for treating a patient 
suffering from tBD comprising administering to said patient an effective amount of an 
antibody specific for DLGS. 

According to another aspect of the invention, the dlg5 gene may be used in gene 
15 therapy* for example where it is desired to modify the production of the protein in vivo, and 
the invention extends to such uses. 

Knowledge of the gene according to the invention also provides the ability to regulate 
its expression in vivo by for example the use of antdsense DNA or RNA. One therapeutic 
means of inhibiting or dampening the expression levels of a particular gene (for example dlg5 
20 identified herein) is to use antissnse therapy. AntisenB© therapy utilises antisense nucleic acid 
molecules that are synthetic segments of DNA or RNA ("oligonucleotides"), designed to 
miroor specific mRNA sequences and block protein production- Once formed, the mRNA 
binds to a ribosome, the cell's protein production "factory" which effectively reads the RNA 
sequence and manufactures the specific protein molecule dictated by the gene. If an antisense 
25 molecule is delivered to the cell (for example as native oligonucleotide or via a suitable 

antisense expression vector), it binds to the messenger RNA because its sequence is designed 
to be a complement of the target sequence of bases. Once the two strands bind, the niRNA 
can no longer dictate the manufacture of the encoded protein by the ribosome and is rapidly 
broken down by the cell's enzymes* thereby freeing the antisense oligonucleotide to seek and 
30 disable another identical messenger strand of mRNA, 

Tims, according to another aspect of the invention there is provided a method for 
treating a patient suffering fiomlBD comprising administering to said patient an effective 
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amount of an antisense molecule capable of binding to the mRNA of the dlg5 gene, and 
inhibiting expression of the protein product of the <Hg5 gene. 

Complete inhibition of protein production is not essential, indeed may be detrimental It 
is likely that inhibition to a state similar to that in normal tissues would be desired. 
5 This aspect of antisense therapy is particularly applicable if the IBD disorder is a direct 
causeof over-expression of the dlg5 gene in question, although it is equally applicable if said 
<Bg5 gene indirectly cause fee IBD disorder. Wife knowledge of fee dlg5 gene and mRNA 
sequence, fee person skilled in fee art is able to design suitable antisense nucleic acid 

therapeutic molecule s and administer them as required. 

10 Antisense oligonucleotide molecules wife therapeutic potential can be determined " — 

experimenting well established techniques. To enable methods of down-regulating 
expression of fee dlg5 gene of fee present invention m mammalian cells, en example antisense 
©spresston construct can be readily constructed for instance using fee pEEPIO vector 
(Invitrogen Corporation). Transcripts are expected to inhibit translation of the gene in cells 
IS transacted wife this type of construct. Antisense transcripts are effective for inhibiting 

translation of the native gene transcript, and capable of inducing the effects (e.g., regulation of 
tissue physiology) herein described. Oligonucleotides which are complementary to and 
hybridisable wife any portion of dlg5 gene mRNA are contemplated for therapeutic use. U.S. 
Patent No. 5,639,595, identification of NovelDrugs and Reagents", issued Jun. 17, 1997, 
20 wherein methods of identifying oligonucleotide sequences feat display in vivo activity are 
feoroughly described, is herein incorporated by reference. Expression vectors wntahiing 
random oligonucleotide sequences derived from the dlg5 gene sequence are transformed into 
cells. The cells are then assayed for a phenotype resulting fiom fee desired activity of the 
oligonucleotide. Once cells wife fee desired phenotype have been identified, the sequence of 
25 the oligonucleotide having the desired activity can be identified. Identification may be 

accomplished by recovering fee vector or by polymerase chain reaction (PCR) amplification ' 
and sequencing the region containing fee inserted nucleic acid material. Antisense molecules 
can be synthesised for antisense therapy. These antisense molecules may be DNA, stable 
derivatives of DNA such as phosphorofeioates ormefeylphosphonates, RNA, Btable 
30 derivatives of.RNA.such as 2'-0-alkymNA, or ofeeroligouudeotide mimetlcs. . U.S. latent . 
No. 5,652,355, 'Hybrid Oligonucleotide Phosphorofeioates", issued July 29, 1997, and U.S. 
Patent No. 5,652,356, 'Inverted Chimeric and Hybrid Oligonucleotides", issued My 29, 
1997, which describe fee synthesis and effect of physfologically-stable antisense molecules, 
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are incorporated by reference. Antisense molecules may be introduced into cells by 

nuoroinjecrioii, liposome encapsulation or by expression from vector 5 harboring the antisense 
sequence. 

As noted above, antisense nucleic add molecules may also be provided as RNAs, as 
5 some stable forms of RNA are now knov/n in the art with a long half-life that may be ' 
administered direcdy, without the use of a vector, m addition, DNA constructs may be. 
delivered to cells by liposomes, receptor mediated transaction and other methods known to 
the art 

The antisense DNA or SNA fcr co-operation with the gene in SBQ ID No: 1 can be 
10 produced using conventional means, by standard molecular biology and/or by chemical 
synthesis as described above. If desired, the antisense DNA or antisense RNA may be 
chemically modified so as to prevent degradation in vivo or to facilitate passage through a cell 
membrane and/or a substance capable of inactivating mRNA, for example ribozyme, may be 
linked thereto and the invention extends to such constructs. '■ 
15 ^^^DNAorantisenseRNAma 

disorders in humans in which the over- or under-regulated production of the dlg5 gene 
product has been implicated. 

Alternatively, ribozyme molecules may be designed to cleave and destroy the dlg5 
mRNAinvfw. Ribozymes are RNA molecules that possess highly specific endoribonuclease 
20 activity. Hamnierheadribozyines comprise a hyoridiakg regton. which is complementary in 
nucleotide sequence to at least part of the target RNA and a catalytic region, which is adapted 
to recognise and cleave the target RNA The hybridising region preferably contains at least 9 
nucleotides. The design, construction and use of such ribozymes is well known in the art and 
is more fully described in Haselhoff and Gerlaoh, (Nature. 334:585-591, 1988). In another 
25 alternative Oligonucleotides designed to hybridise to the 5'-regkm of the dlg5 gene so as to . 
form triple helix structures may be used to block or reduce transcription of the dlg5 gene, hi 
another alternative, RNA interference (RNAf) oligonucleotides or short (18-25bp) RNAi dlg5 
sequences doned into plasmid vectors are designed to introduce double stranded RNA into ' 
mammalian cells to inhibit and/or result .hi the degradation of dlg5 messenger RNA Dlg5 
30 RNAi molecules may begin adenine/adenine (AA) or at least (any base-A,U,C or G)A.. ,. and 
may comprise of 18 or 19 or 20 or 21 or 22 or 23, or 24 or 25 base pair double stranded RNA 
molecules with the preferred length being 21 base pairs and be specific to individual dlgS 
sequences with 2 nucleotide V overhangs or hairpin forming 45-SOmsr RNA molecules. The 



6. JUN. 2003 lb:b1 GLOBAL If PflltWIS BU . r , H| 

101073 

e-. 

-38- 

design, construction and use of such small inhibitory RNA molecules Is wall known in the art 
and is more fully described in me following: Rlbashir et al, (Nature. 411(6836):494-498, 
2001); BIbaehir et al, (Genes & Dev. 15:188-200, 2001); Harborth, J. et al. Q. Cell Science 
114:4557-4565, 2001); Masters et al (Froc. Natl Acad, Sci USA 98:8012-8017, 2001); and, 
5 Tuschlefaf., (Genes & Dev. 13:3191-3197, 1999). 

Pathway mapping may be used to determine each protein in the cell with which the 
DLG5 protein Interacts and, in turn, the proteins with which each of these proteins interacts 
also. In this way h 1b possible to identify the specific critical signaling pathway which links 
the disease stimulus to the cell's response thereb y enabling the identification of new potential 
10 targets for therapy intervention. " '. ~ 

According to a further aspect of the invention there is provided the use of the dlg5 gene 
or a fragment thereof in research to identify further gene targets implicated in ffib. 

In another aspect of the invention, the single nucleotide polymorphisms of this invention 
may he used as genetic markers for this region in linkage studies. 
15 Further features of the invention include: 

A method of treatment of a patient suffering from inflammatory bowel disease, 
comprising administration to the patient of a compound capable of reducing the transcription 
or activity of dlg5 gene products. 

A method of treatment of a patient suffering from BD, comprising administration to the 
20 patient ac inhibitory nucleic acid molecule targeted against the mRNA of dlg5. 

Use of an inhibitory nucleic acid molecule against dlgS or an antibody directed against 
DLG5 proteins, in the manufacture of a medicament for treating IBD. 

As used herein, me term '^inhibitory nucleic acid molecule" refers to molecules selected 
from the group consisting of: antisensB, ribozyme, triple helix aptemer and RNAi molecules. 
25 According to a further aspect of the invention there is provided a method of treating a 
human in need of treatment with a small molecule drug acting on me DLG5 protein or a drug 
comprising an inhibitory nucleic aoid molecule acting against d%5, in which the method 
comprises; 

i) detection of a polymorphism in the dlgS gene in the human, which diagnosis preferably 
-30- comprises determining the nucleotide present within human dlg5 gene that occurs. at position 
16 in the nucleic acid that corresponds to any of SBQ ID Nob: 7, 9, 1 1, 13, 15, 17, 19, 21, 23, 
25, 27, 29, 31, 33. 35, 37, 39, 41, 43, 45, 47, 49, 51, 53, 55, 57, 59, 61, 63, 65, 67, 69, 71, 73, 
75, 77, 79, 81, 83, 85, 87, 89, 91 and 93; 
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10 determining the status of the human by reference to polymorphism fa the dlg5 gene; ^ 
iii) adminiBtering an effective amount of the drug. 

According to a further aspect of the invention there is provided a method of treating a 
human in need of treatment with a small molecule drug acting on the DLGS protein or a drug 
5 comprising an inhibitory nucleic acid molecule acting against the dlgS mRNA, in which the 
method conqxrifieB: 

0 measuring the level of the dlgS mRNA in a tissue sample obtained from the human and, 

h) deterrrnning the status of the human by reference to normallevels of the dlgS mRNA; and, 
Hi) administering an effective amount of die drug, 

10 According to a further aspect of the invention there is pro vided a method of treating a 
human in need of treatment with a Hmall molecule drug acting on the DLG5 protein or a drug 
comprising an inhibitory nucleic acid molecule acting against the dlgS mRNA in which the 
method coiqprises: * ■••..-'*..•. 

i) measuring the level of the DLGS protein in a sample obtained from the human and, 
IS u) deteimiiiing the status of the human by reference to normal levels of the DLG5 protein; 

and, 

iifl ad^rnnistering an effective amount of the drag. 

According to a further aspect of the invention there is provided a method of treating a 
human in need of treatment with a small molecule drug acting on the DLGS protein or a drug 
20 comprising an inhibitory nucleic acid molecule acting against the dlgS mRNA, in which the 



i) detection of a polymorphism in the DLG5 protein in the human, which diagnosis preferably 
comprises deterrnining the amino acid at any one of positions 30, 121, 514, 957, 979 and 1371 
Of the DLGS protein Sequence Shown in SBQ ID NO: 2; 

25 ii) oetermfning the status of the human by reference to polymorphism in the DLGS protein; 
and, 

hi) administering an effective amount of the drug. 

Amethod of treatmentof apatient suffering tromlBD, comprising administration to the 
patient of a compound papable of reducing the transcription or expression of dlgS. 
30 A method of treatment of a patient suffering from IBD, comprising adrninistration to the 
patient an inhibitory nucleic acid molecule targeted against the mRNA of dlg5, 

Use of an inhibitory nucleic acid molecule or an antibody directed against dlgS, in the 
manufacture of a medicament for treating IBD. 
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The invention will be further described by way of the following noa-limfctog examples: 
Example. 3 

Identification of dlffS as a eene as sociated with IBP. 

A genome-wide linkage scan involving 268 families (356 affected tabling pairs) of 
5 European descent was carried out to identify a susceptibility focus for IBD. Subsequently, a 
hierarchical linkage disequilibrium study was employed to search for the causal variant(s) 
within a broad 40cM pericentiomerio interval on chromosome 10, identified from an initial 
linkage scan This endeavour was started with a fine mapping experiment involving 523 

affected Bibling pairs and 16 microsateffite markers at an average distance of 2cM, The fine 

10 mapping experiment confirmed the initial linkage study, reveamigTnnkagBpeak at lQqT 



extending fromD10S201-DlQS192, with a maximum MLS score 1.6 at D10S2470. Linkage 
mapping was followed by transmission disequilibrium testing (TdT) using the algorithm 
implemented in the GENEHUNTER software package (Daly MJetal, Am I Hum Genet, 
Suppl 63:A286,1998) to further narrow the region containing the susceptibility gene. This • 
15 analysis tests for association of a given marker with the disease phenotype in the presence of 
linkage, and represents the most powerful and robust test for association, while omitting the 
risk of false positives due to stratification bias. The TdT performed on a single trio (one of the 
affected sibpairs and its parents) from each of the families Bhowed a significant single point 
association with the disease phenotype at D10S547 (p<0.001) and D10S201 (p<0.01). This 
20 led to the following strategy: the genomic region underlying the linkage peaks on 10pl4- 
10pl3 (extending from 8Mb- 13Mb) and 10q22-10q23 (77Mb-82Mb) were genotyped with 
107 (SNPs) in 200 German IBD families (from the original linkage set) plus an additional 555 
German Trios (368 CD, 1 87 UC), (a trio being a DNA sample from a IBD affected child plus 
DNA samples from its mother and father respectively) and 548 German control individuals 
25 (non IBD affected individuals). Several SNPs showed significant association with CD and 
IBD, respectively, by TdT in single trios extracted from the families and the trios, further 
confirming the presence of linkage disequilibrium with the susceptibility locus over the 
investigated region. It was therefore decided to perform additional association studies with 
SNPb at an average density of 75-120kB to further narrow the region harbouring the 
30. susceptibility gene. .Upon analysis of the high-density SNP panel, the lead region on lOq was 
scanned for nucleotide sequence variants as a highly significant single point association of the 
SNP marker TSC0376484 located at 78.5Mb to IBD was found with a x2=l 1.5, p=0.00067. 
TdT hapfotype analyses farther strengthened the association lead at TSC0376484 resulting in 
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ftignificanl 2-marker and 3-marker haptotypes with the neighbouring SNP markers 
TSC0005010 and TSC0000361. A maximum #=15.44, p=0.00008 was observed for the 2- 
roarker haplotype TSC0376484-TSCO000361 spanning a physical distance of approximately 
150kb. The positive SNP was located adjacent to the dlgS gene. To verify and former clarify 
5 the role ofdlgS in IBD. 6 gene-eased markers for dlgS were genotyped. The analysis of these 
markers also involving testing for haplotype blocks, as described by Daly et aL (Nature Genet. 
29:229-232, 2001), clearly showed that the association signal is entirely confined to the dlg5 
gene with a total of 17 markers with a positive association with IBD (Table 5), all of which 
are located on a common imderlying haplotype. Identical genetic association studies were 

10 also carried put on an adjacent gene. These proved negative, demonstrating that d%5 is the 
sole candidate for the susceptibility locus for IBD on 10q22. In depth re-sequencing for all 32 
exons and the exon-intron boundaries was performed in 47 individuals with proven diagnosis 
for IBD. Nucleotide sequencing was performed according to standard protocols, and the 
primers used are listed in Table 4. The DNA sequencing and analysis identified 20 novel 

15 nucleotide sequence variations (in addition to various publicly available SNPs) located to the, 
dlgS gene. 3 of which lead to an amino acid change of the protein, and further a 7fcp-deletion 
in the intron flanking exon 13. The genotype-related risk (GRR) (Risch&Merikangas, 
Science, 273(5281): 1516-7, 1996) is estimated to be 1.5 -2.5 based on the TOT resulte. 

20 Example 2 dhmfag and, AW* 1 ** aaa fafeia of dies 

A Eenomiq DNA ggnjfe of 41 7779 bp ™* flnny rocted bv asggrnfely of BAG elmum 
AL391421. AL4503nfi apd AT 321SS& . 

Using the database entry for dlg5 cDNA (BMBL accession number AF352034), all 
exons but the 5" UTR were mapped to the human genomic sequences covered by AL391421 
25 or AL4503306. The first 94 bases of AF352034 do not map to this region or any other region 
in the genomic contig around the gene. Since this sequence snowed similarities to multiple 
regions within the human genome it was considered as an artifact derived from repetitive 
sequence. A database BLAST search using sequence from exon 2 was used to identify two 
pig cDNAs containing novel 5' UTR sequence (BMBL accession numbers BM 484383 and 
30 BI402246). This sequence was found to match the human genomic contig and was also 

flanked by a conserved splice site on the 3> end and is therefore considered as the true 5' UTR 
exon of the human dlg5. Table 2 shows htton/exon border sequence mformatton of the 
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DLG5 gene, SBQ ID NO; 1 shows the sequence of dlgS including the novel 5' UTR sequence. 
SBQ ID NO: 2 shows the predicted amino acid sequence of dlg5. 

In depth re-sequencing for all 32 exona and th£> exon-intron boundaries was performed 
in 47 individuals with IBD. Sequences were aligned for SNP detection and in addition to 
5 detection of SNPs present in public databases, the analysis resulted in the identification of 20 
novel SNPs, four of which lead to $H amino acid change of the protein, as well as two novel 
genomic short deletions, one a 7bp-detetion in the intton flanking exon 13, potentially 
influencing the splicing of the gene. The genotype-related risk (GRR) is estimated to b& 1-5 - 
2.5 based on the TDT results. Table 3 lists the identified SNPs and adjacent sequence as well 



10 as the allelic versions of the SNPs. The Table alFcTinclucies posifionTof typed~SNPs*&oxn- 
public databases. Table 4 shows the primer sequences used to PGR amplify the SNP 
containing regions of dlgS. 



TaM*2. 



EXON 


SEQUENCE 


size (bp) I 


axon 01 


GAAGGCGCGGgtgag 


(> 166) 


exon 02 


cccagGTTCTACCTA. . . . 


. AGCAGTGTGGgtgag 


69 


exon 03 


cacagGC ACTACCGG . . . 


. . .TTGACAAGAGgtagt 


163 


exon 04 


gocagGCCCTACCAC . . . 


. . ACTTCTACCAgtgag 


144 


exon 05 


ttcagCACACTCC AC , , . . 


. .GCAGCAGCAGgtagg 


184 


exon 06 




. .CGCTGAGGAGgtagg 


260 


exon 07 


gccagGTCTGAGGCG. , . 


. . . GCTGCGGCAGgtagg 


313 


exon 08 


accagATCAAAGACA. . 


. ACAGCATCCGgtatg 


185 


exon 09 


accagGACACTGTGT. . . 


, . AGGAGCTCAAgtagg 


126 


exon 10 


caeagGGA ACAGATG. . 




133 


exon 11 


tgcagGAGGATATTG. . . 


. . . GCCGCTTA AGgtaag 


128 


exon 12 


ctcagGGTCAATGAC . . 


.. .GGACAGAAAGgtagc 


176 


exon 13 


tgcagACAGTGGCAT. . . 


.. .GATCGTTGCGgtaag 


104 


exon 14 . 


tctagATCAATGGCA. . . . 


. .CCTCCTGAAGgtaag 


93 


exon IS 


cotagGTATTCCCTC 


GTTCCTGGAGgtata 


1020 


exon 16 


tgcagGAACAGA AGT. . 


...CCGTCTGTGGgtgag 


124 


exon 17 




. .TGCATCCC AGgtatg 


145 
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exon 18 


cgcagTGTCCAGCAC. , 


. . . .GCCCGCCTGGgtaac 


113 


exon 19 


aa^gGTTClTCGAG.,. 


. . .TCCGAGAGAGgtaag 


90 


exon 20 


tgcagGTTCAGTGTC 


. . .GAAAGGACAGgtgag 


151 


exon 21 


cttagQCCTTATGTG 


• • .GTTACTGGAGgigag 


163 


exon 22 


cccagTTCAACGGCA 


.. . .CCCGGTOCAGgtgag 


134 


exon 23 


tccagCTCACACCTG... 


. . . GCAGCTCC AGgtcag 


141 


exon 24 


gaaagGATTGCGGGA. . 


. . . .CATCCTGGAGgtgag 


184 


exon 25 


ggcagTATGGCAGOC. . 




149 


exon 26 


tccagGGCCCTGTAC . . . 


. . . GC A AATATGTgtaag 


171 


exon 27 


cacagGATGGACCAA 


.... CTCTTTGAAGgcaag 


197 


exon 28 


ctcagATTCGGTGAG 


. . .TGTCCCCTTGgtaag 


144 


exon 29 


CgcagAGGTGATGAA, . 




123 


exon 30 


cccagAACCG ACACT. . 


. . ■ AGCAC ATCAAg tagg 


110 


exon 31 


cacagGGAGCAGAGA. 


TACTTCACAGgtagg 


110 


exon 32 ~ 


tgcagGQGTCATCCA 


>1749) 



Table 2 shows exon/intron barters for all exons of the oTg5 gene. The first and last 1 0 bp of 
each exon (capital letters) together with 5 bp of surrounding introns (lowercase letters) ^ 
indicated. Also, the total size of each exon is indicated. All sequences can be identified within 
5 human BAG clones with accession number AL391421 and AL45030©". 



Table 3 



FEATURE 


COMMENTS 


SEQUBNCE " 


SEQID 


tsc0000361, 
allele 1 


SNP, G to A, 
nitron 


gCtcggtggcagcgaGtgagaggagctcagt . 


6 


tsc0000361, 
allele 2 


gctcggtggcagcgaAtgagaggagctoagt 


7 


DLGe2, allele 1 


SNP, C to T, ~ 
5' of exon 2 


tggtcteccctctttGcocaggttctaccta 


r 8 


DLGe2, allele 2 


tggtctcccctctttTcccaggttotaccta 


9 
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TS1248655, 
aflfije 1 


SNP,CtoT, 
3'ofexnn2 


gtgggtgagtaccacCgtotggggaggacac 


10 


rsl248655, 
allele 2 


gtgggtgagtacracTfetctggggaggacac 


11 


rsl248696, 
allele 1 


SNP, A to G, 
within exon3 s 


ccctcctcactgaocAgcaagtgaatgagaa 


12 


1*1248696, 
allele 2 


ccctcctcactgaccGgeaagtgaatgagaa 


13 - 


DLGe3, allele 1 


SNP,GtoC, 


ccgcaagcgcctggcCtttgctacgcatggc 


14 




within exon 3, 
synonyjious 






DLGe3, allele 2 


ccgoaagcgcctggoGtttgctacgcatggc 


15 


M248695, allele 1 


SNP, CtoT, 
3' of exon 3 


ctgagtgtcccctttCccccacctcatgtcc 


16 


W1248695, allele 2 


ctgagtgtcccctitTccooacctcatgtcc 


17 


DLG©5A,aHele 1 


SNP.AtoG, 
within exon 5, 
non-synonymous 


ttcagcacactccaoAgccggctcctgagtg 


IS 


DLGe5A,aHete 2 


ttcagcacactccacGgccggctcotgagtg 


19 


DLGe5B, 
allele 1: 


SNP.-CtoA, 
3' of exon 5 


creagcccaggagaQggojatttctccea 


20 


DLGeSB, 
allele 2: 


cccagoccctggagaAi:ggccatttctccca 


21 


rsl248680, 
allele 1 


SNP,AtoC. 
intfoii 


ggcagccacccacctActC^gatccagccta 


22 


rsl248680, 
allele 2 


ggcagccaCCCacctCctcagatccagccta 


23 


DLGe7, j 
allele I 


SNP, Gto A, 
within exon. 7, 
synonymous - - 


gaga$£cacgcaggtGaagacagcaaaggag 


24 


DLGe7, 
allele 2 ' 


gagaaccacgcaggtAaagacagcaaaggag 


25 


ib1270912, 
allele 1 


SNP,TtaC, 


ctcagctgtggtggaTagactggacagtgcc 


26 
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NS1270912, 
allele 2 


intron 


ctcagctgtggtggaCagactggacagtgec 


1 27 - 


DLGelO, 
allele 1 


SNP,GtoC, 
within exon 10, 


I gaagttgtagagttcGagagggagacggtaa 


28 


DLGelO, 
allele 2 


non-synonymous 


1 gaagttgtagagtteCagagggagacggtaa 


29 


DLGel3A, 
allele 1 


SNP.CtoT, 
within exon 13, 


! ggagtgtatgctgcCgctgtgotgcptgga 


30 


DLGel3A, 
allele 2 


synonymous 


ggagtgtatgctgcTgctgtgctgcetgga 


31 


DLGel3B, 
allele 1 


deletion 

GCTGGAG 3' of 


gcggtaagtetcaagGCTGGAGccagggt 
catctgco 


32 


DLGel3B, 
allele 2 


exon 13 


gcggtaagtotcaagcoagggtcatctgce - 


"33 


DLG&14A, 
allele 1 


SNP, G to A 
5' of exon 14 


ggtaggcctgaggccGctctgccigtggcct 


34 


DLGeUA, 
allele 2 


ggtaggcctgaggecActctgcctgtggcct 


35 


rsl24S629, 
allele 1 


SNP, C to G, 
within exon 14, 
synonymous 


tgaatctctgctgcgCagctgccaggactec | 


36 


rsl248629, 
allele 2 


tgaatctotgctgcgGagctgccaggactec i 


37 


DLGel4B, 
allele 1 


SNP, G to A | 
3' of exon 14 


catgctactacttggGgtcacaggatccttg 


38 


DLGel4B, 
allele 2 




catgctactocttggAgtcacaggatccttg I 


39 


DLG&15A, 
allele 1 


SNP,CtoT, f 
within exon 15, 


cggggagoccatgcaCgcatoaccccctogo 


40 


UlAael5A, 

allele 2 


synonymous 


cggggagcccatgcaTgoatcaccccctcgo 


41 


DLGel5B, 
allele 1 


SNP, G to A 
within exon 15, 


acgcaecaccccctcGcaaggccagggtccg 


42 
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DLGelSB, 
allele 2 



DLGeldC, 
allele! 



DLGelSC, 
allele 2 



f s228930S, 
allele 1 



rs2289308, 



allele 2 



wl248634, 
allele 1 



ral248634, 
allele 2 



tsl24863S, 
allele 2 



rel248635, 
allele 4 



DLGel8, 
allele 1 



DLGelS, 
allele 2 



DLGel9, 
allele 1 



DLGel9, 
allele 2 



TB1248625, 
allele 1 



M1248625, 
allele 2 



DLGe23, 
allele 1 



ewbAL jr ffticwio 



-46- 



non- synonymous 



acgcatcaeccecteAoaaggccagggtccg 



SNP.CtoT, 
within cxon IS, 
non-synonymous 



SNP.CtoT, 
S' of exon 17 



SNP.CtoT, 
within exon 17, 
synonymous 



SNP.CtoT, 
3" of exon 17 



actcotcccacctgcCggccaagaaatcctg 



actcctccoaciptgcTggccaagaaatcctg 



:cc 



SNP, Gto C, 
3» of exon 18 



tacgcttctctgtacTccagctgcccaagco 



SNP.CtoT, 
wjthin exon 19, 
synonymous 



tcctgggactga] 



43 



44 



45 
46" 



47 



48 



49 



50 



51 



52 



S3 



SNP.GtoC, 
3' of exon 21 



ac 54 



gagtgtcgtgggctcTgagagaggtaagga 



55 



tcctagggaacagcaOtgctcocaagtoCCC 



56 



SNP.CtoT, 
within exon 23, 



57 



acacctggaccotgoCggtacccaetCCflct 58 
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DLGe23, 
allele 2 


synonymous 


acacctggaccotgcTggtaccoactccact 


159 


rs2289310, 
allele 1 


~ SNP,CtoA, 
within exon 23, 


ggcctagcaccccccCagccaagcagagcag 


|60 


1*2289310, 
allele 2 


non-synonymous 


ggcctagcacccccoAagccaagcagagcag 


[61 


rsl261990, 
allele 1 


SNP, T to A, 
5' of exon 25 


aggagggatgttgaaTttctgccgtatggtc 


62 H 


1*1261990, 
allele 2 




aggagggatgttgaaAttctgccgtalggtc 


63 


DLGe25A, 
1 allele 1 


SNP.CtoG, 
5' of exon 25 - 


gccgtatggtcagcaOggcccctctcgggt 


64 I 


DLGe25A, 
allele 2 




gocgtatggtcagcaGtggcccctctcgggt 


65 


DLGe25B, 
allele 1 


SNP,GtoA, 
5' of exon 25 


gc actggcccctctcGggtgcccaagctgcc 


66 j 


DLGe25B, 
allele 2 


gcactggcccctctoAggtgDccaagctgcc 


67 


rsl05B198, 
allele 1 


SNP.CtoT, 
within exon 26, 
synonymous 


gagctttaagaaggaCgacatcctt^acgtg , 


68 | 


131058198, 
allele 2 


gagctttaaganggaTgacatcctctacgtg 


69 


DLGe26, 
allele 1 


deletion of A, 


SgggtggggtggggcAggggtcgccgagggc 


70 


DLGe26, 
allele 2 


3' of exon 26 


ggggtggggtggggcggggtcgccgagggc 


71 


[rs2289311, 
allele 1 


SNP, T to C, 


agggcagcagggtctTgatggcoctgcccag 1 


72 ~1 


allele 2 


5' of exon 27 


agggcagcagggtctCgaiggccotgcccag 


73 } 


DLGe27, " ] 
allele 1 i 


5NP,CtoT, " 
within exon 27, 


agatgacaatagcgcCacaaagacgctgtca f 


74 j 
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DLGe27, 
allele 2 


synonymous 


agatgacaatagcgcTacaaagftogctgtca 


75 


rn2241831, 
allele 1 


SNP, C to A, 
5'ofexon29 


ggttactgacagotgCtgagcagtgttcuc 


76 


K2241831, 
allele 2 


ggttactgacagctgAtgagcagtgttcttc 


77 


rs2241833, 
allele 1 . 


SNP.CtoT, 
intron 


cctggatgectgggaCgacagacatgacaga 
cctggalgcctgggaTgacagacatgacaga 


78 
79 


allele 2 








fs2579150, 
allele 1 


SNP.CtoT, 
3'ofewm31 


ggctgttttcttagcCgtggagaagcccgcg 


80 


TS2579150, 
allele 2 


ggctgttttcttagcTgtggagaagccogcg 


81 


rtl0582Q2, 
allele 1 


SNP, Gio A, 
3'UTR 


gccgcctgaggggaoGccagactoagctott 


82 


relOS8202, 
allele 2 


gccgcctgaggggacAcoagactcagctctt 


83 


MLO58203, 
allele 1 


SNP.CtoT, 
3'UTR 


aagtagaagtctgtcCgtctatgaaoatgcg 




181058203, 
allele 2 


aagtagaagtotgtcTgtctatgaacatgcg 


85 


rs2165046, 
allele 1 


SNP, A to G, 
3'UTR 


tgtctatgaacatgcAggggaaggatccgga 


86 


n£16S046, 
allele 2 


tgtctatgaacatgcGggggaaggatccgga 


87 


rs2165047, 

aIIaIa 1 


oJXF, Ci to A, 
3'UTR 


ctctcctggaaggacGtcaeaacKJcaggtg 


88 


M2165047, 
allele 2 , 


ototootggaaggacAtcaeaactccaggtg 


89 


W2579151, 
allele 1 


SNP,CtoG, 


tctecagaagcttcaCtcacaciccactggt 


90 
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rs3579151, 
allele 2 


3 ! of gene 


tctccagaagcttcaGtcacactccactggt 


91 


tsc0376484, ~~ 
allele 1 


oiNr, (jf to A, 


agagttagacttcttGaacaaccttttaagg 


92 


tac0376484, 

allele 2 < 


3* of gene 


agagitagacttcttGaacaaccttttaagg 


93 



Table 3 identifies the SNPs and adjacent sequence as well as the particular allelic version. He 
table includes novel SNPs identified through mutation detection as well as SNPs ftompublio 
5 databases. All the public domain SNPs used have an rs- or tscnumber which win identify 
them uniquely in the genome. 



Table 4 



Name 


Sequence ' " ' 


aBQ ID NO; 


Dlg5exlF 


5'-CCATGACGGAGGTGGAAGC 


""97 ~" 


Dlg5exlR 


S'-AGAGGAGCGAGTCCACCGA 


98 


Dlg5ex2F 


5'-GACTGATGATCAGCTGGCTTG 


99 


DkjSex2R 


S'-CGGAAGGATGATCCTGTGAG 


96 


Dh|5ex3F 


5 , -CCAGGGGAGGATGCAA 


97 


Dlg5ex3R 


S'-ACAAGCACACCACTATCAGGG 


98 


Dlg5ex4F 


5-CCTAATCCAGGACCTGGTTC 


99 


DlgSex4R 


y-CTTGCACAGGGACAGGACTAG 


100 


Dlg5ex5F 


5 '- GCTGTATCTACGGGAAGTGTTG 


101 ~~ 


JDlgSexSR 


5'-GATCACAGATGTGAGCCAACG 


103 


Dlg5ex6F " 


5-ajrimX^TCAGTCTCACCCTC 


103 


Dlg5ex6R 


5'-GAGCCACGATTCCCAAGAGA 


104 


Dlg5ex7|? 


5'-ACATCTCGCCACCrC3TCTjTG 


105 


DlgSex7R 


5 '. tgctgtaggagaggctgaaa — : 


106 . 


Dlg5ex8F 


5'-TCAGCAACCTCrrCCCTCTTC 


T07 


Dfe5ex8R 


S-ACGCCAGCTTGAGGTCAC 


108 


D]g5ex9F 


5'-TGACCTTGTCCTCTGCTCCf 


T09 
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Dlg5ex9R 


5'-CATTGCCTTGCCCAQAAQ 


110 


Dlg5exlOF 


5"-(XGTGGCTCTCTCTGTrCAC 


111 


DlgSexlOR 


5-TCACCGTCrCCTCCTTCATC 


112 


Dl&SexllF 


5-TTTGTGAAATC5GTTGCTGT 


113 


DlgSexllR 


5'-GTGCTACXJrGGCTCTCTTCG 


114 


Dlg5exl2F 


5'-CCCX3TGAGTGAGAGTTGTGG 


115 


DlgSexl2R 


5'-CCACTGAGGTrGATGTGCAG 


116 


Dlg5exl3F 


5-CACCAGGGTAGATGTAACTGAG 


117 


Dlg5exl3R 


5'-GATTOTATTTCCCTCCCAGAC 


118 


Dlg5exl4F 


5'-CTCCTGACTTTGGCACCTTG 


T19 


DlgSexl4R 


5'-TTCAGACCAGCGTCCAGTCA 


120 1 


Dlg5exl5aF 


5'-TGCTTTACCTCTGGGGATGG j 


121 


DlgSexlSbF 


5*-ClTCX2GK?rCAGATGCCTCTG 


122 


DlgSexlScF 


5'-CAGAAGGAGCGACTCCATTAAG 


123 


Dlg5exl5aR 


5-GCAAAGGCACCAGGCTAAAC 


124 


Dlg5exl5bR 


5-GGTAGTAGCTGGAAGCAATGC 


125 


Dlg5exl5cR 


S'-CAGAGAGCTTCTCAGGCACTG 


126 


D1g5exl6F 


S'-TGGCCACACTCCACTCTTTC 


127 


Dlg5exl6R 


5-CTCAGGGCTGAAAACACATG 


128 


DlgSexl7F 


5'-GAGCCACAGCCACATTGTGA 


129 


DlgSesxHR 


y-GGAAGCTTCICCAGCAATGA 


130 


Dlg5exl8F 


5 , -GAACCXTrGCCTGGTCTGTG 


131 


Dlg5exl8R 


S'-GAAAGCAATGGCTCTGACAG 


132 


Dlg5exl9F 


S'-GACTGGTAGCCTGGTGGAGA 


133 


Dig5exl9R 


5'-GAAGTTCTCAGCTAAGCCCAG 


134 


Dlg5ex20F 


S'-GCAATGCAGAGCCTAGCATC 


135 


Dlg5ex20R 


5'-TGCTGGGACACTCAAGCTAC 


136 


Dlg5ex21F 


5'-CTGCACTGTCAGATCATATGC 


137 


Dlg5ex21R 


5 , -ACACCAGGATGGGCTCAGTG 


138 


Dlg5ex22F 


S'-GAACAGCAGTGCTCCCAAGT 


139 


DlgSex22R 


S'-CCAGAACTTACGGCTGGCAC 


140 


Dlg5ex23F 


5'^CTCAGATCTAGTTGCCACAGG 


141 
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Dlg5ex23R 


S'-CCACTTGGAGAATGTGCTCAG 


142 


Dlg5ex24F 


y-CCAAGAAGCAGGCAGAAAGC 


143 


Dlg5ex24R 


5'-TGTACTCCTCCGTCTTTGGTG 


144 


Dlg5ex25F 


S'-GACTCAGTCCTTCCTGCAGAG 


145 


D3g5ex25R 


5'-CAO:AGGAAAAGAGTCTCCAG 


146 


Dlg5ex26F 


S'-CTCGGCGATTCCTGATCAAG • 


147 


Dlg5ex26R 


5-GAAGCAGAATCCCTCCTCCAG 


148 


DJg5ex27F 


S'-GTAGCCITGAGACCTGCCAAG 


149 


DJg5ex27R 


S'-TGTGGCTCTGAAGATGGCAG 


ISO 


Dlg5ex28F 


S'-GTGCTCATGCTGGACTCCAG 


151 


Dlg5ex28R 


5-CAGGCTTCTGGAACACTGTG 


152 


Dlg5ex29F 


5 '- GTC AGATTCATGC ATGGCAG 


153 


Dlg5ex29R 


5-CAGGCACAGGTGAACTCAGAC 


154 


Dlg5ex30F 


5 '-CTGTGTGGCTTTACTGCCTTG 


155 


Dlg5ex30R 


5'^ATAGGCCCATCTCTCATTC 


156 


DlfiSex31F 


j -uL l u ITGCTGTGC1TTATGTG 


157 


DlgSex31R 


5-AGAATCCTGACGTTGGCCAG 


158 


Dlg5ex32F 


S'-CTGGTGAAGGAGAGTCAGGTG 


159 


Dlg5ex32R 


5'-GTGCTTCTGGGTCCTGGTTC 


160 



Table 4. Primers used for Nation detection of the cOgS gen* Each primer is named after 
exon number and either F for forward or R for reverse primer, 



5 TableS 



SNP 


IBDfami] 


ies and trios 






IBD 

trios 








T:U 


T/U 


ch!2 


p-value 


T:U 


T/U 


chi 2 


p-value 


TSC0376484 


272:198 


1.37 


11.65 


p=0.0006 


206:150 


1.37 


8.81 


p=0.0029 


rs2579151 


275:225 


1.20 


5,00 


p=0.02 


212:170 


1.24 


4.62 


p=0.03 


ra2 165047 


283:232 


1.21 


5.05 


p=0.02 


214:168 


1.27 


5.54 


p=0.018 


rs2165046 


369:296 


1.25 


8.01 


p=0.004 


277:212 


1.30 


8.64 


p=0.003 


rsl058203 


286:244 


1.17 




D.8. 


221:181 


1.22 


3.98 


p=0.04 
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220:178 


1.23 


4.43 


p:=O.03 


rs2241833 






4.77 


p=u.G2 


219:172 


1.27 


5.65 


p=0.0l7 


f 522893 11 








p=0.002 


267^204 


1.30 


8.43 


p=0.003 


rsl058198 








p=U.u(J17 


262:193 


1,35 


10.46 


psO.OOl 


rel2619M 






4.43 


p=0.03 


221:174 


1.2S 


5.59 


p=0.01 


DLGelS 




1 OB 


y./ / 


5=0.002 


272:206 


1.32 


9.11 


p=0.003 






I. it) 


8.53 


p=0.003 


265:205 


1.29 


7.66 


p=0.005 


rsl248634 


287:235 


1,22 


5.18 


3=0.02 






j.ij 


p=l),02 


rsi248680— 


^284:244. 






ELB» 


217:177 


1.22 


4.06 


p=0.04 


rel24869d 


134:105 


1.27 


3.S2 


p=0.06 


107:84 


1.27 


2.9 


p=H).09 


DLGe3 


128:44 


2.90 


41.02 


p=0.000000 


102:32 


310 


34.00 


p=0.000000 


TSC0068S13 


327281 


L16 


as. 




257:203 


1.26 


6.34 


p=0.011 1 



Table 5. Data demonstrating the association between IBD and dlg5. 



5 BSBBSilgS analyfliapf DLG5 on a p an el of normal tWa jmd colon SiflBgjgg p^u— 
With IBD u sing real j imy. pPR 
Real time Pn? 

Real-time PCR-experimentS where perfonned on Applied Biosystems 7900 HT in 384 format 
with Syber Green chemistry (double-stranded DNA binding dye, minor groove binding) and 

10 fluorescent probes. In order to be able to detect any unspecific amplification and melting, 
curve analysis was performed after each completed PCR. 

Bach sample was run in duplicate with 4ng of template in each reaction (lOjxl). The 
PCR reactions were run at 50 cycles for both sample and reverse transcriptase negative 
controls. Non-template controls where also studied to confirm that the signals were not due to 

15 the primers themse^es. The prhners where designed firorncDNA sequence in the program 
Primer Express, manufactured by Applied biosystems. The primers were all complementary 

. to themtron exonj^crion Aa internal standard h36b4 (acidic ribosomal phosphoprotem P0) 
and p-actin were used to normaliae for differences"ihTUNTA input and cDNA synthesis— 
The relative expression in the different samples was calculated by the parameter C T (threshold 

20 cycle). Oris defined as the fractional cycle number at which the fluorescence passes a 
threshold above baseline. 
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" **"* ** (<3r tiLai "Or 36B4) alrenuttiveb, CCDLG5-& p-acdn) 
Or DL05 = the threshold cycte fa te ^ ^ ^ ^ ^ 

Or 36B4 s the threshold cycle for 35b4 in the same tissue. 
5 Orp-wtto the threshold cycle for p-aatata the santeUssue 

The primer, used forDI/35 (Cxoo27-«oo2 8 ) odeored in 5- to 3- direction were 
forward: GCAGTGACTCCATTCCACTCTTT (SEQffiNO: 161) audreve*, 
^G^GTCCACCTTCT^SEQtDKO: .62). Ateo prtaer. were used for eM o 29-30 
forward: GGAGAAGCGQCCATTTCG (SEQ ID NO: 163) and reverse' 
10 CrCAATAGCGTGCOOAGCAACSEQ ID NO: 164). eson 32 forwtu* 
CAGTOTGGGTGTCTTCGTTIT3G (SEQ ID NO: 165) and reverse: 

ATTAACAGGACGCATAGCTTAAGGA (SBQIDNO: W»-.-N« rffc „ B , 
I™*rs exos.29-30 andeson 32, fluoreaeen. pruhe, were used for deletion use* 

-™«o n29 .30: AAAGGAQATCACAGAAAAGAAOCOACACTGC (SEQ IDNO- 
IS 167) ™^ ^'^^^*GATATGTCTCK3KrraAAGA' T TTGATGTG (SEQ ID NO- 

^l^ecot, of anpBfed fre^ for e**, 27-28 was done usins Syber gLn ' 

Foreadogenocscomrolgeeeescchas acidic «b M o m a 1 p to sphop K)tefa p 0 <h36M) a M 
20 N^TT *—* OCArrCrATCATCAACQOGTACAA (SEQ ID 

7 1 ^^ O ^^GTAAlXX( S BQl DN O,n0)forI ) 36M 
andforwerd: AGOCTCGCCnTGCCGA (SEQ ID NO; 171) and reverse- 

^GGTGCCTGGGGCO (SEQ ID NO: 172) for p-actat Rjr P-actin, detection with a 

^T^ZT^^ ^<*™CACACeCGCC (SEQ ID 
NO. 173), while Syber Green ChereisBy was used for detection of h36b4. 
25 Tismenrntf 

_^>»«M,» P *- taa ».I* 11BWllhlh 

used: heart, pfecen*. liver, skeletal remote, kidney, pancreas as part of 
^ MTCTIM , mi . Teajj , ^ ^ ^ M ^ 

^ 1 ™ ,m!MrcTm n ^ I « S -^"«NA (HU n B n b ra i n,.hote 
fc«T«« WtoHH* wasesedforfoecDNA synthesis. Hvetandrednaungrereof**, 
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RNA was used for each reaction using the oligo dT primer provided to the kit for the reverse 
transcriptase <RT) reaction and RT negative controls. 
Colon Biopsies 

Colon biopsies were taken from patients using endoscope and a grasp biopsy tooL 
5 BiopBies were taken from the mucosa at different parts of the colon and terminal ileum. 



Tissue distribution of DLG confirmed high expression in placenta and testis and also 
high expression, hi brain, prostate, descending colon and fleam as shown to Table 6. 
J Analysis of colon bjopstes showed amarkedly decreased expression to Crohn's (CD) patients 



10 compared to non-JBD disease controETAffiw^ 

samples consisting of 6 non-ffiD controls and 29 CD cases shown to Table 7. 
A more modest decrease was seen to an another subset consisting of 16 hospitalised 
normals(HN)<appro^ 1.5-fold) and 39 non-lBD controls (DQ(approx. 2-fold) compared to 23 
healthy controls as shown to Table 8. Whereas a consistent difference was seen to a third and 
15 a fourth subset of 40 ulcerative colitis (UQ and 49 CD samples compared to 25 healthy 
normals. The results are shown to Table 9 and 10, suggesting a 1.5- and 2.3-fold decrease 
respectively. 
In gurmnaiyj 

We have found DLG5 to be expressed to most tissues examined: placenta, heart, 
20 prostate, skeletal muscle, liver, pancreas, kidney, brain, colon, testis, ileum and small 

intestine. The highest amount of expression was detected to placenta and ileum. We have also 
identified a significant difference of expression between colon biopsies fromlBD patients 
relative to non IBD controls and healthy controls. 





2 -uw 


Tissue 


WOO 


Prostate 


7.21 


Placenta 


31.1 


Heart 


1.7 


Skeletal muscle 


1.71 


Liver* " • 


0,42 


Pancreas 


1.44 


Kidney 


2.39 


Brain 


19.86 


Descending Colon 


5.28 
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Testis 


6.91 


Ileum 


7.9 


Small intestine 


2.26 



1 55 ■ 



Table 6. Relative expression of DLG5 in selected human 



Exon 27-26 


DC(n=S) 


CD(n=29T 


Mean 


23.88 


4.08 


Median 
Pvalue 




' 4.043 



Table 7. Relative expression analysis using real time PCR according to formula 2* a *1000 
5 using h36b4 as endogenous control. Colon biopsies were taken fromnon IBD disease controls 
Cu=6) and Crohn's disease patients (n=29). P-values are emulated using Student's T-test 
assuming non equal variance. 



Exon 29-30 






Mean 
Median- 
Pvalue 


Normal tp=Z3\ 
2.778 
2.800 


1.939 
1.980 
4.364E-04 


DO (n=^S> 
1.476 
1.300 

5.303E-09 










Exon 32*"" 


Normal (n=2S) 


HNjn=16) 


DC (n=39) 




Mean 
Median 
Pvalue 


0.954 
0.866 


0.630 
0.485 
3.24E-02 


0.420 
0.251 
1.63E-05 





10 Table 8. Relative expression analysis using real time PGR according to formula 2** *10Q0 
for colon biopsies taken from healthy normals, HN = Hospitalised normals (n=16), DC = 
Non-IBD disease controls fe=40). P-values are calculated using Student's T-test assuming 
non equal variance. Primers against bom exon 29-30 and exon 32 were used showing 
reproducible results, 



15 





Normal /n=2si 




Mean | 2.382 
Median 8 2.408 
Pvalue J 4.31 r-nr 


1.444 
1.355 


Exon 32 






Mean 
Median 
Pvalue 


1.00 
0.88 
1.10E-06 


0.4S ~ 
0.43 



6. JUN. 2003 15:56 GLOBAL IP PATENTS 
101073 



NO. 3779 P. 



56. 



Table 9. Relative expression analysis using real time PCR according to formula 2' ac * *1000 
ibr colon biopsies taken from healthy normals and Ulcerative colitis (UC) patients. P-vahies 
are calculated using Student's T-test assuming non equal variance. Primers against both exon 
5 29-30 and exon 32 were used showing reproducible results. 



Exon 32 


Normal (n=25) 






Mean 
Median 
P value 


0.927 
0,881 
1.003E-04 


0.570 
0.517 


Exon 29-30 


Normal (n=24i 


CD (n=49J 




Mean 
Median 
. P value 


2.508 
2.170 
2.07E-04 


1.588 
1.4B0 



Table 10. Relative expression analysis using real time FCR according to fbrmnla 2 ,fla *1000 
for colon biopsies taken from health normals and Crohn's disease (CD) patients. P-values are 
10 calculated using Student's T-test assuming non equal variance. Primers against both exon 29- 
30 and exon 32 were used showing reproducible results. 
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1. A method for identifying a compound capable of modulating the action of the DLG5 
protein which method comprises subjecting one or more test compounds to a screen ' 
comprising a polypeptide comaining the amino acid sequence shown in SBQ ID NO: 2, or a 
homologue thereof or a fragment of either. 
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Sequence listing: 

SEQ ID NO: 1 shows the 7269. base pairs nucleotide sequence of dlgS cDNA. Predicted start 
and stop codons are underlined. Hie first 166 nucleotides are derived from the novel 5' UTR 
exon. 

5 SEQ ID NO: 2 shows the predicted DLGS peptide sequence of 1809 amino adds. 
SEQ ID NO: 3 shows the sequence of the novel 5* UTR identified for dJg5. 
SEQ ID NO: 4 shows the proposed 5' UTR sequence from EMBL entry AF352034. 

SEQJDNp:! 

k)— gaggegggaggggg^aaggggxsaggtgot 

cgggaccacttgcaggacctgcg<3gcggcgctggagaagacgcagccrcaccrm 
ctgcccattctctacctgaacggcgtcgtcgggccgccgcagcccgccgaaggcg 
cgggttctaktacagcgtcctgtcx:accaj^cctcagactcagaaagcagcag 
ctoxtcagicagtgtgggcactaccgggaaggcgrc 
15 actgaccagcaagtgaatgagaaggtggagaacctcto:attcagctgcggcrg 
atgacccgggagagaaacgagctccgcaagcg<xrrggccrrrgctacgcatggc 

ACGG^CTTTGACAAGAGGCCCTACCACAGGCTGAATCCTGACTATGAGAGGCTGA 

AGATCCAGTGCGTGCGAGCCATGTGGGACCTGCAGAGCCTGCAGAACCAGCACA 

CCAACGCX?ITGAAGAGGTGTGAGGAGGTGGC^ 

20 CACTCCACAGCCGGCTCCTGAGTGACCAGACTCGGCTGAAGGATGACGTGGACAT 
GOTGAGGCGGGAGAATGGGICAGCTGCTGCGGGAGCGAAACCTGKnXj 
ATGGGAGGACATGAAGCGGCTCCACGAGGAGGACCAGAAGGAGATCGGTGACCT 
CCGTGCCCAGCAGCAGCAGGTGTTGAAGCACAACGGGTCATCCGAGATTCTCAAC 
AAACTGTATGACACGGCCATGGACAAGTTGGAGGTGGTCAAGAAGGACTATGAC 

25 GCO^CGGAAGAGGTA(^GTGAGAAAGTCGGCATCCACAATGCAGACCTGAGC 
CGCCTGGAGCAGCTGGGGGAGGAGAACCAGCGGTTGCTGAAGCAGACAGAGATG 
CTGACCCAGCAGAGGGACACGGCCATCCAGCTGCAGCA(X:AGTC 
TGAGGAGGTTTGAGGCGATCCACCATGAGCTGAACAAGGCCACGGCGCAGAACA 
AGGACCTGCAGTGGGAGATGGAGCTCCrGCAGTCAGAGCTGACCGAGCTGAGAA 

30 GCACGCAGGTGAAGACAGCAAAGGAGTCGGAGAAATACAGGGAGGAGCGGGAC 
GCTGTGTACAGCOAGTACAAGCT^ 

AGCTGGACAAGCTGCAGACCGAAGTGGAGCTGGCCGAGTCCAAGCTCAAGAGCA 
GCACATCTGAGAAGAAGGCGGCCAATGAGGAGATGGAGGCGCTGCGGCAGATCA 
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AAGACACX3GTGACAATGGATGCTQ(K3AGAGCCAACAAGQAGOTTGAAATCCrTC 
QAAA0CAOTOCAAQQCICIX3TQCCAGQAQCraAAGQAAGCCCTCCAGGAQOCQO 
ATOTQGCCAAGTGCCOGCGGOACTGGGCCTTCCAQQAaCGAQACAAGATTGTAG 
CAGAGCGTGACAGCATCCGGACACTGTGTGACAACCJTGAGGCGGGAGCGGGACC 

AGAAGAATGATGTCAGCCGCGAGCTGAAGGAGCTCAAGGAACAGATGGAATOCC 

AGTTGGAAAAGGAGGCXXGGTrCCGACACTCTGATGGGQI^CAGCTCCCACGACT 

OGGCCATrGACACGGATTCCATGGAGTGGGAAACGGAAGTrGTAGAGTTCGAGA 

10 TCAATGAGCCTTGTTTXXCGGGGGACTGTGGCATATTTQTCACTAAAGTGGACAA 
AGGAAGCATTGCIt^TGGCCGCTrAAGQGTCAATGACTGGCTCCT'GAGAATCAAC 
GATGTGGACCTCATCAACAAC5GACAAGAAGCAGGCCATCAAGGCGCTCCTCAAT 
GGGGAGGGGGCCATCAACATGGTCGTGCGGCGGAGGAAGTCCCTGGGTGGGAAG 
GTaGT^CGCCGCIXKTACATGAACCTCAGTGGACAGAAAGACAGTGGCATCAGT 
^r AA ^^ A ^ G ^ GT ^ C ^ AA ^«^GCTAAAG 

^I^^^ AGACAQQA ™^^ 

CAACAAGTCTCTGAATGAATOTGAATCTClt3CTGCGGAGCTG0CAGGACrC0CrG 
ACCCTX3T0CCTCCTGAAGGTATICCCTCAGAGCTCC^X^TGGAGTGG0C^ 

20 GGAGGTCCAGGCTCATAACAAACGGAACITGATACAGCACAATAACTCCA^^A 
GACAGACATCTICTACACGGACAGGCn^3GAAGACAGGAAGGAGOCAGGCCCOCC 

T^CCXTCAGGCCTGTCCCAGTGCCTCTGAGCX^AGCCTGAGCrCCTICCGCTCAG 

AA^CAGAC^GAA^CCAACAOCGGCGGGACX.GGCCCA^^ 
^^^^ 

ACIACClGClTC^GGTIXrGGGCCraCrWCICT^ 
" ^^-^CAAAACCnXCAGAAGGAGCGACT^AAG^rG 

^! A r^^ QACTAa ^ A6 ^ QAAGmAc ^^^<Kn^T 

OCACTAGKrCC X «AG C<3 CCCrGaXrC TO ACGTG^CCCC C K 3 GGAG^^ 



CGCATCACOCCCTCGCAAGGCCAOGG' 



TCCGCATTGCITCCAGCTACTACOCTGAA 
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GGAGATGGGGACTCCTCCGACCTGCCGGCCAAGAA^ 
CCTCCXIAlGAAGGTGGATGAGCTGQGGCAQAAQCGTCGCCGGCQAAAATCT 
CCAGTTrrcGGCCGAAGCTKK^TCX^AGTAGTGATr 

ACAGAAGTGTGTCCCGGCCAGTGGAGAACTCTCCCCGGAGCTCCAGGAGTGGGC 
6 ACCTTACTCGCCTGGGCATTCCAGCCGGCACAGCAACCCCCCGCTATACCCTAGC 
AGGCCGTCTGTGGGCACTGTTCCCXGGAGTTTGACC^ 

CCATCCTGCGGAACCCCATCTACACTGTGCH3CAGTCACAGGGTCGGCCCCTGCAG 
CTCTCCL\CCTGCGGCCCGAGATGCTGGCCCCCA.GGGTTTGCA 
CACCAGGGACGCCTGAGCCTGGACCTGAGCCACAGGACCTGCAGCGACTACTCC 
ro~GAGATGAGAGGGA€eGATOGQT(^AAG3rGACnX3CCC^ 

cttcgagtaacttgcagttcaaggcggaacgcat^^ 
atatccgcggagtgtcgtgggctccGagagaggttcagtgtcacattctgaatgc 
agcactcctccacagtcacccctgaacatcgacaccctgtc . 
ccagacctcagcctccacattgcccagaatcgctqtgaaccccgcgtccctcggg 

1 s gagcggagaaaggacaggccttatgtggaggagccacgccacgtgaaggtgcag 
aagggctcagagccgctgggcatctccatc^tgagtgc^ 
tacgtctccaaggtgaccgtggggagcatcgctcaccaggctggcctcgagtatg 
gggatcagttactggagttcaacggcataaacctgcggagcgccacggagcagc 
aggcgcggctcatcatcgggicagc^gtgtga^ 

20 caacccccacgtgcaccagctcagcagccactcccggtccagctcacaccxggac 
cctgccggtacc^crrccactctccagggcagtggcaccaccaccccggagca'rg 
catctgtcatcgacccactgatggagcaggacgaggggcctagcacccccccagc 
caagcagagcagctocaggattgcgggagatgccaacaagaagaccctggagcc 
acgcgttgtgttcatcaaaaagtcccagctggagctrggggtgcacttgtgtggt 

25 gggaacctgcatggggtgtttgtggccgaggtggaggatgacagtcctgccaag 
ggtcctgacggcctcgtgcgaggggacctcatcctggagtatggcagcctggacg 
tgcggaacaagacagtggaggaagtctatgtggagatgctgaagcccagggatg 
gcgtccgcctgaaggtccagtaccgccctgaggagttcacgaaggccaagggcc 
tgcctggtgacagcjttcttacatcagggccctgtacga 

30 gcaagagrtgagctitaagaaggacgacatcctctacgtggatgacaccttaccc 
cagggcacgttcgggtcctggatgg^^ - 
atc cag cgcgg gcagattcccagc aaatatgtg atgg acc aagaattctccagg 
aggctcagcatgtctgaagtcaaagatgacaatagcgcx:acaaagacgcrgtca 
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cac^Z^ 00 ^^ 

T^T^ OA1TC ^ AtK ^^ A ^<^reCAGAAGGTCGAC TO 

CACAO AAAAQAACCQACACTGCXTTCXITOGACATrQCrCCGCACOCrATroAGOGQ 
CTCCACCACATCCACATCTACCCCATTG^^CATOACTAC^alS^! 
10 ^^^AA^AOCAaAOA^CCCA^CC^aGOA^^^ 

QQCATrcCAAAQAGCAQriTGAOaCGacQCAOAAQCTrGAQCAO^QT^AGC^ 

CTCJ^QGAGAATGCTGTQCraTCOAlXMCTQCAQCTGOCCQrc^QAG^MQAC^^ 

CGGGGGAAGGATCCGGAACCAGGACCXlAGAAGCACCTCCTTrGTAGACAGAGf^ 

^CirrAACACAAAAOAGATAACACTAAAGACG^T^GCA^!^^ 

20 GCCTTTIX^AAGTATITGGTOGTrrATOTITACrrGAACGGCrcCATGTrGCCGOT' 
. ^^C^r^^ TOTCGC ™™^S 
CGTC7TX!^GCAAAACGAOCTTGGAACCTCAATGK3GGGCTGCTrrGCrTTGGGAGG 
^^^^^^^^OGGACCAGAGCTTrGACAAACCTCCnSCICCTTGGTGGC^^^^T 

C^^f^^^ T ^ AAMTCA ™«TI OT ACAAA 

TOCTTATTACIT^AGGACIGTGTCTGCrcCGTGAGAQCIGCGTG^^C^^GGC^^^ 

GGCTCTGTAAGTCGTGACAGCCTrcATCAGTOCAATGTTTGCAGGGTAATTCTTA 

AACAGTC^AAAAAa]^ 

CATGCACCCTACICAGTGTGGGTGTCTrCGTTrQQGQGCnTITrrrTTrrrrQC^^T 
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TCTGAGGCTAGATATGTCTGGCTGAAGATTT^^ 
GTCCTGTTAATAATAGGTACTGTACTGGGCTCTGTGTAAGTGT 
ACCTATATTITAATACTGTTCeTAACATCT 
TTCATTrTATTCTTTGTAATTCTAGACACTAGAT^ 
5 TTTTTTAAAAAGGGATATA' 1 ' I ' 1" 1U i' 1GC AC AGTTGTTCAAAAAAGAG ACAAQITTC 
AGTCCTCAATGCTGTCCTTTGTTCT^ 

ATGAAAAACTGTAGACTATTCTCAAGGTATTAACra3CAGACCCTCT 
GGGCTGTTTTCTAAGTTACAGGCAGAGTGGGACTGAGATGGTACAGTGTGCACAG 
ACAGGTACTGAGCTGACAGACTGGGATTTTC^ 
"1TT7>[^GTrffl^ACAGGeTTT^ 

SEQ ID NO: 2 

MPSDSESSSSLSSVGTTCKAFSPmiTBQQ^ 

ATHGTATOKRPYHRIJ^POYEP^ 
13 YHTUKXUXSDQTRIJ^DVDM^^ 

I^AQQQQVIJvHNGSSEIIJ^J^ 

LEQIXffiENQRIXKQflEMLTQQRDTAIQLQHQC^ 

WEMBLLQSELTBIJlTTQVKrAKESBKYREERDAVYSEY 

BVELAESKLI^STSEKKAANEBMEAI^ 
20 lJHiALQBADVARC^ 

DDTRKQK3TOVS1^LKH[^^ 

EISEUffTEDIDLKADSHDMABGVNEPCEPGDCGIF^ 

DVDLIIsnKDKKQAIKALLNGEGAI^^ GISLEN 
GVYAAAVI^GSPAAKBGSIAVGDRrvrAINGlA^ 
25 WQSSSWSGQMEBNIiaiSDKMI^ElUHGPBVQAH^ 

BDRKEPGPPGGSSSHJIKOTGGPLQVCPQACPSAJERSLSSI^DASGDRGPGLVDVR 

GMa*LLPFBlBVGPCGVGEA3LD^ 

QRKSITOP>rrFKRPQTO 

ETSSESEATLVGSSPSTSPPSAU'l^VDrcEPMHASPPRXARViaASSYVl^GlJGGSSH 
30 l^AKKSCDEDLTSQKVDELGQin^ 
SPELQBWAPYSPGHSSl^SNEPLYP^ 
FCSSPFAARDAGPQGUIPSVQHQGI^II>I3HR^^ 
SS^QFKAEPJKIPSTPRYPRSWGSERGSVSH^ 
LPRIAVI>flV\SIXjERWa}RPYVEBP^ 
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HQAGLEYGDQLLBFNGIMJRSATBQQARIJIGQCJCDTrn^ 

HLDPAGTHSTLQGSGTTTPEHPSVTDPLlSffiQDEGPSTPPAK^ 
VVHKKSQIBmVHLCCK^^ 

TVBEVYVBMUO^GVI^^ 
5 KDDILYVDDTLPQGTFGSVi'MAWQIJ^ENAQiaQRGQIPSKYVMD 

ddnsatk^i^aaarrsffrrkhkhkrsgskdgkdlla 

RVQK^TX^AlJlPYljlXSPIXDVVKEMLVl^ 

glfvdyke^sghfdvttva^ikb^ 

KHKBQKDPIYI^KVTQRHSIEQ]^QKLBC^YSRYFrGVIQGG 
10 VNQEQNKVLWIPACPL 

SEQ ID KIQ; fl 

SAGGCGGGAGGCGGCAAGGCGQAGCTGCTGCTCAAGCTGCTCTTGGCCAAGGAGCGQ 
GACCACTTCCAGGACC7X3CGGGCGGCGCTGGAGAAGACGCAGCCTCACCTGCTGCCC 
15 ATTCTCTACCTGAACGGCGTCGTCGGGCCGCCGCAGCCCGCCGAAGGCGCGG 

SEQ (PNQ:4 

GTCTCAACTTAAACTCCAGCACCACGACCCTACTACTATCTCGCACCTGAAACAAGCTAA 
CATGACTAACACCCTTAATTCCATCCACCCTCC 

20 

SEQ ID MO: ft 

GGGGCTGTGGCCACGGGGGAGCGGGTGGCGGCCGCGGGGCC0CGGTGAGCC5GGGC 
CGGGGTGCACGGGGGCCGAGGCGGCCTCCGAGCCGGGOCGGAGCTGTCGGGAGCCT 
GGGAAACACGGCCCAGGCGACrrrCTCCCGGGAGTTGTAGTOTrGOTCTGGGGGCGG 

25 CGCGCGGCCCGGCTTTCGGGGCGAGTTTTCTCGCGGGGCGGGCQGCGCGCGCGGAG 
GCTGAGGGGGAGGGGGAGCATGCCCGGCTGGAGCCCGGCCTCGGGGCGCGCCCCG 
GCCGCGGTCCCCACCTCCGCOTOOTCCTTCTCCTCCTCCGTCTCCTCCTCCTCC^^ 
CAGTCCCGCCGCCGCGGCAGCCAACATGGCTGCGCTCCGGAGCCCGGGAGGCGGCC 
GCTGCTGAGCGCCGCCCGCCGCCOOGGGWXCCAGCCCGCCCAGGCCCCGGCCCAT 

30 GGCTCCCGCCGCCTCACGGCTCCGGGCGCCCTAGQGCTGCTGCCGAGGTGCTCGGCT 
GCCCGCCCGCCGGCGTGGGGGGGOCCGGGAGGCGTCC 
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